131
Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of Pittsburgh [email protected] http://www.hsls.pitt.edu/molbio

Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

  • View
    214

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Locating Gene/Protein Information

January 11, 2011

Ansuman Chattopadhyay, PhDHead, Molecular Biology Information ServicesHealth Sciences Library SystemUniversity of [email protected]

http://www.hsls.pitt.edu/molbio

Page 2: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Objectives

Generate a gene list

Mine gene/protein information

http://www.hsls.pitt.edu/molbio

Page 3: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Topics

Literature Informatics

Gene / Protein Information Gateways

Search Engine for MolBio / Bioinformatics Databases and Software

http://www.hsls.pitt.edu/molbio

Page 4: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Topics

Literature Informatics Comprehensive search:

MESH term based PubMed Search PubMed Special topics query

Next-generation literature search tools: Gopubmed GLAD4U HugeNavigator

http://www.hsls.pitt.edu/guides/genetics

Page 5: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

http://www.hsls.pitt.edu/guides/genetics

Page 6: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Literature InformaticsWhich genes/proteins are reported to be associated with the disease - Schizophrenia?

http://www.hsls.pitt.edu/guides/genetics

Citations: 19 millionJournals: 5200

Schizophrenia: 86,384.. 96234Schizophrenia gene: 5851…7295

Page 7: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Challenges in Literature Search Am I getting everything? Too much Information.. How to digest?

http://www.hsls.pitt.edu/guides/genetics

A list with citations

Page 8: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Medical Subject Heading (MESH)

http://www.hsls.pitt.edu/guides/genetics

Page 9: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Medical Subject Headings (MeSH)

http://www.hsls.pitt.edu/guides/genetics

The U.S. National Library of Medicine's controlled vocabulary (thesaurus)

Arranged in a hierarchical manner called the MeSH Tree Structures

Updated annually

Page 10: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

MeSH Vocabulary Headings

over 24,000 representing concepts found in the biomedical literature (Body Weight, Kidney, Radioactive Waste)

Subheadings attached to headings to describe a specific aspect of a concept

(adverse effects , metabolism, diagnosis, therapy)

Supplementary Concept Records over 172,000 terms in a separate chemical thesaurus -updated

weekly (cordycepin , valspodar , tacrolimus binding protein 4)

Publication Types(Letter, Review, Randomized Controlled Trial)

http://www.hsls.pitt.edu/guides/genetics

Page 11: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

MeSH Tree Structure A. Anatomy

B. OrganismsC. DiseasesD. Chemical and DrugsE. Analytical, Diagnostic and

Therapeutic Techniques and EquipmentF. Psychiatry and PsychologyG. Biological SciencesH. Physical SciencesI. Anthropology, Education,

Sociology and Social PhenomenaJ. Technology and Food and BeveragesK. Humanities L. Information Science M. Persons N. Health CareV. Publication Characteristics Z. Geographic Locations

http://www.hsls.pitt.edu/guides/genetics

Page 12: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

MeSH Indexing

http://www.hsls.pitt.edu/guides/genetics

Source: NLM

Page 13: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

MeSH Indexing

http://www.hsls.pitt.edu/guides/genetics

Genes/Chemicals

MeSH Terms

Page 14: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

PubMed Query Using MeSH http://www.ncbi.nlm.nih.gov/mesh

http://www.hsls.pitt.edu/guides/genetics

Page 15: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

http://www.hsls.pitt.edu/molbio

Find articles on “Dengue outbreaks in India” by searching PubMed

using Mesh terms

Link to the video tutorial:http://media.hsls.pitt.edu/media/molbiovideos/pubmedsearch1.swf

Resources

• Mesh Browser : http://www.ncbi.nlm.nih.gov/mesh• PubMed: http://www.ncbi.nlm.nih.gov/pubmed

Page 16: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Building PubMed QueriesTerm Boolean Term Boolean Term # papers

Dengue AND Outbreaks 823

Dengue * AND Outbreaks 746

Dengue AND Outbreaks AND India 131

Dengue* AND Outbreaks AND India 116

Dengue AND Outbreaks/statistics and numerical data

AND India 7

Dengue* AND Outbreaks/statistics and numerical data

AND India 7

http://www.hsls.pitt.edu/guides/genetics

Page 17: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Useful links for MESH

MESH Browser: http://www.ncbi.nlm.nih.gov/mesh Link to Wikipedia, Youtube videos, blogs etc on “medical

subject heading”: http://www.kosmix.com/topic/Medical_Subject_Headings?

18 ways to improve your Pubmed searches by Carrie Iwema http://bitesizebio.com/2008/03/05/18-ways-to-improve-your-pubm

ed-searches/ Searching by using the MeSH Database. NCBI

Handbook : http://www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=helppubmed

&part=pubmedhelp#pubmedhelp.Searching_by_using_t

http://www.hsls.pitt.edu/guides/genetics

Page 18: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

http://www.hsls.pitt.edu/molbio

Find genes that are reported to be associated with the disease SCHIZOPHRENIA by searching PubMed

Link to the video tutorial:http://media.hsls.pitt.edu/media/molbiovideos/pubmedsearch2.swf

Resources

• PubMed Clinical Queries: http://www.ncbi.nlm.nih.gov/pubmed/clinical

Page 19: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Topic-Specific PubMed Queries http://www.nlm.nih.gov/bsd/special_queries.html

http://www.hsls.pitt.edu/guides/genetics

Page 20: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Research on Optimal Search Strategies

http://www.hsls.pitt.edu/guides/genetics

Page 21: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

PubMed Special Topic Queries

http://www.hsls.pitt.edu/guides/genetics

Page 22: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Filters

http://www.hsls.pitt.edu/guides/genetics

Page 23: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

PubMed Search Filter: Medical Genetics ("schizophrenia"[MeSH Terms] OR

"schizophrenia"[All Fields]) AND (("genetics, medical"[MeSH Terms] OR ("genetics"[All Fields] AND "medical"[All Fields]) OR "medical genetics"[All Fields] OR ("medical"[All Fields] AND "genetics"[All Fields])) OR ("genotype"[MeSH Terms] OR "genotype"[All Fields]) OR "genetics"[Subheading] AND ("genetics"[Subheading] OR "genetics"[All Fields] OR "genetics"[MeSH Terms]))

http://www.hsls.pitt.edu/guides/genetics

Page 24: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

PubMed Search Result Display

http://www.hsls.pitt.edu/guides/genetics

Page 25: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Latest Innovations in Literature Searching

GoPubMed Display search results sorted into meaningful topics and subtopics

http://www.hsls.pitt.edu/guides/genetics

Page 26: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

GoPubMed

http://www.hsls.pitt.edu/guides/genetics

www.gopubmed.com

Page 27: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

http://www.hsls.pitt.edu/molbio

Find genes that are reported to be associated with the disease

SCHIZOPHRENIA by using GoPubMed

Link to the video tutorial:http://media.hsls.pitt.edu/media/clres2705/gopubmed.swf

Resources

• GoPubMed: http://www.gopubmed.org/web/gopubmed/2?WEB10O00h00100090000

Page 28: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

GoPubMed Search Result

http://www.hsls.pitt.edu/guides/genetics

Page 29: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

GoPubMed Search Result Analysis

http://www.hsls.pitt.edu/guides/genetics

Page 30: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

GoPubMed Search Result Analysis

http://www.hsls.pitt.edu/guides/genetics

Page 31: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Latest Innovations in Literature Searching

http://www.hsls.pitt.edu/guides/genetics

Page 32: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

PubMed driven Web Tools

http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/search/

Page 33: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Literature to Gene List

GLAD4U http://bioinfo.vanderbilt.edu/glad4u/

http://www.hsls.pitt.edu/guides/genetics

Page 34: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene list to common functions

http://www.hsls.pitt.edu/guides/genetics

Page 35: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Literature to Gene list

http://www.hsls.pitt.edu/guides/genetics

http://www.quertle.info/

Page 36: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

NIH Grant Applications to Gene List

http://www.hsls.pitt.edu/guides/genetics

Page 37: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Curated Molecular Databases

http://www.hsls.pitt.edu/guides/genetics

Page 38: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Molecular Databases

Nucleic Acids Research : Annual databases Issue NAR: Annual Web Server Issue Oxford Journal : Bioinformatics BioMedCentral: BMC Bioinformatics

http://www.hsls.pitt.edu/guides/genetics

Growth of bioinformatics tools

Page 39: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Growth of Molecular Databases

Source: Nodal Point Blog

2008: 1078

http://www.hsls.pitt.edu/guides/genetics

2011: 1330

Page 40: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

GWA Studies Catalog

http://www.hsls.pitt.edu/guides/genetics

Page 41: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

GWA Studies Catalog

http://www.hsls.pitt.edu/guides/genetics

http://www.genome.gov/gwastudies/

Page 42: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Engine Just for Human GeneticsCDC HuGENavigator : http://hugenavigator.net/

http://www.hsls.pitt.edu/guides/genetics

Page 43: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

http://www.hsls.pitt.edu/molbio

Find human genes that are reported to be associated with the Asthma

Find human SNPs that are reported to be associated with the Asthma

Link to the video tutorial:http://media.hsls.pitt.edu/media/clres2705/asthma.swf

Resources

• HugeNavigator:

http://hugenavigator.net/HuGENavigator/home.do

Page 44: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Engine Just for Human Genetics

http://www.hsls.pitt.edu/guides/genetics

Page 45: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Engine Just for Human GeneticsCDC HuGENavigator : http://hugenavigator.net/

http://www.hsls.pitt.edu/guides/genetics

Page 46: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Engine Just for Human Geneticshttp://hugenavigator.net/HuGENavigator/huGEPedia.do

http://www.hsls.pitt.edu/guides/genetics

Page 47: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Engine Just for Human GeneticsCDC HuGENavigator : http://hugenavigator.net/

http://www.hsls.pitt.edu/guides/genetics

Page 48: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Find Disease Causing SNPs

What SNPs are associated with “Schizophrenia”?

http://hugenavigator.net/HuGENavigator/gWAHitStartPage.do

http://www.hsls.pitt.edu/guides/genetics

Page 49: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Hands-On exercise on lit search

Which proteins are related to Alzheimer’s disease?

Where/who are the leading centers and scientists for liver transplantation?

Which hormones are Autistic Disorder associated with?

http://www.hsls.pitt.edu/guides/genetics

Page 50: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene/Protein Information Mining

http://www.hsls.pitt.edu/guides/genetics

Page 51: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Bioinformatics Databases & Software Providers

National Center for Biotechnology Information (NCBI) Home page Site map Resource Guide

European Bioinformatics Institute (EBI) Home page Databases Software

http://www.hsls.pitt.edu/guides/genetics

Page 52: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene Information Gateways

o Open access resources:

National Center for Biotechnology Information (NCBI) Genbank Refseq

Entrez Gene Gene Expression Omnibus (GEO) OMIM

http://www.hsls.pitt.edu/guides/genetics

Page 53: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Information Hubs

o Open access resources: European Bioinformatics Institute (EBI)

Uniprot Interpro Prosite STRING

UCSC Genome Bioinformatics BLAT Search Gene Detail Page

http://www.hsls.pitt.edu/guides/genetics

Page 54: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Information Hubs

o Open access resources:

National Center for Biotechnology Information (NCBI) Refseq Entrez Gene Conserved Domain Database (CDD) Molecular Modeling Database (MMDB) 3D structure viewer: Cn3D

http://www.hsls.pitt.edu/guides/genetics

Page 55: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene/Protein Information

Chromosomal location, mRNA,

genomic seq, orthologs, paralogs,

regulatory elements,

Amino acid seq, domain architecture,

protein structure, post translational modifications

Gene expression, biological pathways,

protein interaction map, disease association, biomarkers

http://www.hsls.pitt.edu/guides/genetics

Page 56: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene Questions ?

What is its function?

What are its neighboring genes?

What is its genomic seq?How many splice varients are there?

What are its intron-exon architechure?

What diseases are associated with it?

Which tissues it expressed ?

How can I get its cDNA clone?

http://www.hsls.pitt.edu/guides/genetics

Page 57: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

SNP

Genomic Sequence

Expression Profile

Interacting Partners3D Structure

mRNA Sequence

Chromosomal Localization

Disease

Amino acid Sequence

Homologous Sequences

http://www.hsls.pitt.edu/guides/genetics

NCBI : Entrez Gene

Page 58: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Entrez Gene

Find: gene symbols and aliases sequences: genomic, mRNA, protein intron-exon architecture genomic context: neighboring and antisense

genes Interacting partners associated gene ontology terms: function,

cellular component and biological process

http://www.hsls.pitt.edu/guides/genetics

Page 59: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Entrez Gene

a searchable database of genes, from RefSeq genomes, and defined by sequence and/or located in the NCBI Map Viewer

Statistics Gene: 7974 organisms Genbank: 160,000 organisms

each record represents a single gene from a given organism

http://www.hsls.pitt.edu/guides/genetics

Page 60: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

NCBI Sequence Databases

GenBank archival database of nucleotide sequences

from >160,000 organisms More info GenPept

conceptual translation of GenBank CDS Refseq

based on GenBank record, non-redundant expert verified databases of reference sequences

http://www.hsls.pitt.edu/guides/genetics

Page 61: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

International Nucleotide Sequence Database Collaboration

http://www.hsls.pitt.edu/guides/genetics

Page 62: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Primary Vs Derivative databases

http://www.hsls.pitt.edu/guides/genetics

Page 63: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

RefSeq Scope & Accessions

Genomic DNA NC_123456 - complete genome, complete

chromosome, complete plasmid NG_123456 - genomic region NT_123456 - genomic contig

mRNA - NM_123456 Protein - NP_123456

more about RefSeq scope and accessions...

http://www.hsls.pitt.edu/guides/genetics

Page 64: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

RefSeq Status Codes

Provisional Reviewed Predicted Genome Annotation

more about RefSeq status codes

http://www.hsls.pitt.edu/guides/genetics

Page 65: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Hands on

Find mRNA sequence for your gene of interest (p53, BRCA1, EGFR, PLCg1)

Start page: Entrez core nucleotide Use Limits, History and Preview Index

http://www.hsls.pitt.edu/guides/genetics

Page 67: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Video Tutorials

http://www.hsls.pitt.edu/molbio/videos?c=3

http://www.hsls.pitt.edu/guides/genetics

Page 68: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Find mRNA Sequence for Reelin Gene.

http://www.hsls.pitt.edu/guides/genetics

Page 69: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene FunctionWhat is its function?

Entrez Gene Page:

Summary (TOC)Gene Ontology

GeneRIFsPathways (TOC)

Biosystems (Links)

http://www.hsls.pitt.edu/guides/genetics

Page 70: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene Ontology (GO)

Controlled vocabulary tagging

• Function• Biological Processes• Cellular Component

http://www.hsls.pitt.edu/guides/genetics

Page 71: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene Ontology (GO) and KEGG GO

information page GO evidence codes

KEGG Information page

http://www.hsls.pitt.edu/guides/genetics

Page 72: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Function How many splice variants are there?What is/are its sequence?

Entrez Gene Page:

Genomic regions…(TOC)

UCSC (Links)

http://www.hsls.pitt.edu/guides/genetics

Video Tutorials

Page 73: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Alternative Splicing

http://www.hsls.pitt.edu/guides/genetics

Page 74: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Intron-Exon Coordinates

What are its intron-exon architechure?

Entrez Gene Page:

DisplayChange it from

Full report to Gene Table

http://www.hsls.pitt.edu/guides/genetics

Video Tutorials

Page 75: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Neighboring Genes

What are its neighboring genes?

Entrez Gene Page:

Genomic context(TOC)

http://www.hsls.pitt.edu/guides/genetics

Video Tutorials

Page 76: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Chromosomal location

http://www.hsls.pitt.edu/guides/genetics

Page 77: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Associated DiseasesWhat diseases are associated with it? Entrez Gene

Page:TOC• General

Information_Phenotype

LinksOMIM

HuGE Navigator 

http://www.hsls.pitt.edu/guides/genetics

Video Tutorials

Page 78: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

HomologeneWhat are its homologous genes?

Entrez Gene Page:

LinkHomologenechange Display

settings

http://www.hsls.pitt.edu/guides/genetics

Video Tutorials

Page 79: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

ReagentsHow can I get its cDNA clone?

..antibodies? .. siRNA ?

Entrez Gene Page:

TOC:Additional Links

Research MateriasExact Antigen

http://www.hsls.pitt.edu/guides/genetics

Video Tutorials

Page 80: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Information Gateways

http://www.hsls.pitt.edu/guides/genetics

Page 81: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

UniprotKB : Universal Protein Resource : a comprehensive, centralized protein

information resource Developed by a consortium:

European Bioinformatics Institute (EBI) the Swiss Institute of Bioinformatics (SIB) the Protein Information Resource (PIR) Comprised of:

--Swiss-Prot: biologist-curated annotation data

--TrEMBL: computationally annotation data

--PIR-International Protein Sequence Database (PIR-PSD): the most comprehensive and expertly-curated protein sequence database in the public domain for over 20 years.

Funded by: NIH, NSF, the European Union and the Swiss Federal government

Link to Wiki, YouTube, Blogs and Tweets: http://www.kosmix.com/topic/uniprot?

Tutorial Video: http://www.youtube.com/watch?v=TCF3qWn7siI&feature=youtube_gdata

http://www.hsls.pitt.edu/guides/genetics

Page 82: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Questions ?

http://www.hsls.pitt.edu/guides/genetics

What is its Function?Amino acid sequence?

… molecular wt? isoelectric point (PI)? …post translational modifications? … presence of domain/pattern/profile? … hydrophobicity? … homologous orthologs? Etc.

Structure? … secondary and tertiary?

Interaction Partner?

Page 83: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Uniprot Video Tutorial

http://www.hsls.pitt.edu/molbio/videos/play?v=19

http://www.hsls.pitt.edu/guides/genetics

Page 84: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Function from UniprotKB Uniprot Search:

http://www.hsls.pitt.edu/guides/genetics

Look under: general annotation_Function, ontologies_keywords, geneontology

Page 85: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Sequence

Uniprot

•Sequence annotations•sequences

Gene•Genomic regions, transcripts, and

products•ccds (consensus cds report)

UCSC

•Sequence and links

http://www.hsls.pitt.edu/guides/genetics

Page 86: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Sequence Analysis

http://www.hsls.pitt.edu/guides/genetics

PTM

•Uniprot•Seq annt

•IPA•Modificatio-ns and Regulation

PI/MW

•Uniprot

•Seq_Tool•Compute PI

Hydroph-obicity

•Uniprot

•Seq_Tool•ProtScale

Peptide Digest

•Uniprot•Seq_Tools•PeptideMass•PeptideCutter

Homologous Seq

•Entrez Gene•Homologene

Domain/pattern•Uniprot•Sequence annotation•InterPro•Entrez gene•Conserved Domain

Page 87: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Domain Resources

Protein Domain Databases:

InterPro

http://www.hsls.pitt.edu/guides/genetics

Page 88: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Domains Wikipedia:

A protein domain is a part of protein sequence and structure that can evolve, function, and exist independently of the rest of the protein chain. Each domain forms a compact three-dimensional structure and often can be independently stable and folded. Many proteins consist of several structural domains. One domain may appear in a variety of evolutionarily related proteins. Domains vary in length from between about 25 amino acids up to 500 amino acids in length. The shortest domains such as zinc fingers are stabilized by metal ions or disulfide bridges. Domains often form functional units, such as the calcium-binding EF hand domain of calmodulin.

http://www.hsls.pitt.edu/guides/genetics

Page 89: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Domain: SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands

with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway

components and mediate multiprotein complex assemblies.

http://www.hsls.pitt.edu/guides/genetics

Page 90: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Structure

Primary

Secondary

Tertiary

Quarternary

http://www.hsls.pitt.edu/guides/genetics

Useful links: http://www.kosmix.com/topic/protein_structure?

Taken from wikipedia

Page 91: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Structure

http://www.hsls.pitt.edu/guides/genetics

NCBI

Page 92: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Finding Protein Structure

PDB

Entrez Structure

NCBI BLINK via Entrez Gene/Protein

http://www.hsls.pitt.edu/guides/genetics

Page 93: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Structure Databases and Viewer Databases:

RCSB Protein Data Bank (PDB) State University of New Jersey (Rutgers), the San Diego Supercomputer Center at the University of California San

Diego, the University of Wisconsin-Madison Link http://www.kosmix.com/topic/protein_data_bank?

MMDB NCBI's structure database is called MMDB (Molecular Modeling DataBase), and it is a

subset of three-dimensional structures obtained from the Protein Data Bank (PDB), excluding theoretical models..

Viewer: Cn3D :

a helper application for your web browser that allows you to view 3-dimensional structures from NCBI's Entrez retrieval service.

Rasmol: EBI First glance in j mol : A simple tool for macromolecular visualization. (More..)

http://www.hsls.pitt.edu/guides/genetics

Page 94: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Structure

Search for the 3D structure of P53 Entrez structure

View the crystal structure of mouse p53 core domain (MMDB: 42987) or Crystal Structure Of A P53 Core Dimer Bound To Dna ( PDB:2GEQ)

http://www.hsls.pitt.edu/guides/genetics

Page 95: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Manipulating the Structure Viewer Window

Page 96: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Find Similar Structure: NCBI VAST

http://www.hsls.pitt.edu/guides/genetics

Page 97: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

NCBI BLink

BLink ("BLAST Link") displays the results of BLAST searches that have been done for every protein sequence in the Entrez Proteins data domain.

To access it, follow the BLink link displayed beside any hit in the results of an Entrez Proteins search.

http://www.hsls.pitt.edu/guides/genetics

Page 98: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Hands-on Protein Structure

View the crystal structure of Chronophin (PDB entry: 2P69).

A variant of this protein with mutations in its amino acid sequence has been isolated. Can you predict any effect of its mutations into its function?

Hint: Find the amino acid residues which are in close contact (3.5 A) with PYRIDOXAL-5'-PHOSPHATE (PLP).

Label the amino acids and save the picture in PNG format. Learn more on Chronophin structure at:http://kb-dev.psi-structuralgenomics.org/KB/archives.jsp?pageshow=3

http://www.hsls.pitt.edu/guides/genetics

Page 99: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Hands-on Protein Structure of Chronophin

http://kb-dev.psi-structuralgenomics.org/KB/archives.jsp?pageshow=3

http://www.hsls.pitt.edu/guides/genetics

Page 100: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Sequence Alignment in Cn3D

NCBI

http://www.hsls.pitt.edu/guides/genetics

Page 101: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Hands-On Can you identify the human protein which contains a

short peptide sequence: GPDGMPVIYHGHTLTTKIKFSDVLHTIKE ?

What is its function? What is its calculated PI and molecular wt? Which region of this protein is most hydrophobic? Locate five experimentally verified S/T/Y phosphorylation sites present in this

protein. Find the homologous mouse and fruit fly orthologs of this human protein and

report the % protein identity it shares with these orthologs. How many protein domains are reported to be present in this human protein? Find the location of its largest domain.

http://www.hsls.pitt.edu/guides/genetics

Page 102: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Licensed Tools for Gene/Protein Information

http://www.hsls.pitt.edu/guides/genetics

Page 103: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

HSLS Licensed Tools

BioBase Metacore Ingenuity IPA

http://www.hsls.pitt.edu/guides/genetics

Page 104: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Gene/Protein facts from Biobase

http://www.hsls.pitt.edu/guides/genetics

http://goo.gl/9wpwG

Page 105: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

BioBase BioKnowledge Library

http://www.hsls.pitt.edu/guides/genetics

Page 106: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protein Function from IPA

http://www.hsls.pitt.edu/guides/genetics

Page 107: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Engine for Bioinformatics Tools

http://www.hsls.pitt.edu/guides/genetics

Page 108: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Biomedical and Life Sciences Search Engines

OBRC : University of Pittsburgh

http://www.hsls.pitt.edu/guides/genetics/obrc

Vadlohttp://vadlo.com/

OReFil : University of Tokyo

http://orefil.dbcls.jp/

http://www.hsls.pitt.edu/guides/genetics

Page 109: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search.HSLS.MolBio

http://www.hsls.pitt.edu/guides/genetics

http://www.hsls.pitt.edu/molbio

Page 110: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search.HSLS.MolBio Integrated search system

Databases & Software Articles on Databases & Software Genes/Proteins Pathways Protocols Seminar/Talks Videos Recommended Articles

Tabbed browsing Clustered search results

http://www.hsls.pitt.edu/guides/genetics

Page 111: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search term: “phosphorylation”

http://www.hsls.pitt.edu/guides/genetics

Page 112: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Molecular Databases and Software:search term: “Phosphorylation”

http://www.hsls.pitt.edu/guides/genetics

Page 113: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Search Result Page

http://www.hsls.pitt.edu/guides/genetics

Page 114: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Citation Trackers

http://www.hsls.pitt.edu/guides/genetics

Page 115: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Searh PubMed Articles on Databases and Software : “phosphorylation”

http://www.hsls.pitt.edu/guides/genetics

Page 116: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Articles on Databases and Software

http://www.hsls.pitt.edu/guides/genetics

Page 117: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Articles on Prediction of Phosphorylation Sites

http://www.hsls.pitt.edu/guides/genetics

Page 118: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Prediction of Phosphorylation Sites

http://www.hsls.pitt.edu/guides/genetics

Page 119: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

MetaPredPS

http://www.hsls.pitt.edu/guides/genetics

Page 120: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Clustering Remix

http://www.hsls.pitt.edu/guides/genetics

Page 121: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Genes/Proteins Info

http://www.hsls.pitt.edu/guides/genetics

Page 122: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Entrez Gene

http://www.hsls.pitt.edu/guides/genetics

Page 123: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

BioBase Knowledge Library

http://www.hsls.pitt.edu/guides/genetics

Page 124: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Protocols:

http://www.hsls.pitt.edu/guides/genetics

Page 125: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Seminar Talks Video

http://www.hsls.pitt.edu/guides/genetics

Page 126: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Seminar Talks Video

http://www.hsls.pitt.edu/guides/genetics

Page 127: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Recommended Articles

Faculty of 1000 Biology: a literature awareness tool that highlights and reviews the most interesting papers published

in the biological sciences, based on the recommendations of a faculty of well over 2300 selected leading researchers.

http://www.hsls.pitt.edu/guides/genetics

Page 128: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Faculty of 1000

http://www.hsls.pitt.edu/guides/genetics

Page 129: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Recommended Articles

http://www.hsls.pitt.edu/guides/genetics

Page 130: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Recommended Articles

http://www.hsls.pitt.edu/guides/genetics

Page 131: Locating Gene/Protein Information January 11, 2011 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System

Thank you!Any questions?

Carrie Iwema Ansuman [email protected] [email protected] 412-383-6887 412-648-1297

http://www.hsls.pitt.edu/molbio