15
PIR: Protein Information Resource Tao Ma Georgia Institute of Technology 29 June, 2006

PIR: Protein Information Resource

Embed Size (px)

DESCRIPTION

Tao Ma Georgia Institute of Technology 29 June, 2006. PIR: Protein Information Resource. Content. Overview Major Modules Search and Analysis Tools Demo Pros and Cons. Overview. An integrated public resource of functional annotation of protein data - PowerPoint PPT Presentation

Citation preview

Page 1: PIR: Protein Information Resource

PIR: Protein Information Resource

Tao Ma

Georgia Institute of Technology

29 June, 2006

Page 2: PIR: Protein Information Resource

Content• Overview

• Major Modules

• Search and Analysis Tools

• Demo

• Pros and Cons

Page 3: PIR: Protein Information Resource

Overview

• An integrated public resource of functional annotation of protein data

• Support genomic/proteomic research and scientific discovery

• Provide PIRSF family classification system

• Provide iProClass integrated database of protein family, function, and structure

Page 4: PIR: Protein Information Resource

Major Modules

• UniProtUniversal Protein Resource

• iProClass Integrated Protein Classification

• iProLink Integrated Protein Literature, Information and Knowledge

• PIRSF PIR Super Family

Page 5: PIR: Protein Information Resource

UniProt Overview• The world’s most comprehensive catalog of information

on proteins

• Created by joining the information contained in Swiss-Prot, TrEMBL, and PIR

• Retrieve curated, reliable, comprehensive information on proteins

Page 6: PIR: Protein Information Resource

UniProt Structure

http://pir.georgetown.edu/pirwww/about/brochure.pdf

Page 7: PIR: Protein Information Resource

iProClass Overview

• Provide summary descriptions of protein family, function and structure for UniProt sequences

• Link to over 90 biological databases

• Comprise reports for all UniProtKB proteins

• Present comprehensive up-to-date information on proteins and protein data mapping

• Retrieve thorough information about a protein

Page 8: PIR: Protein Information Resource

iProClass Structure

http://pir.georgetown.edu/pirwww/about/brochure.pdf

Page 9: PIR: Protein Information Resource

iProLINK Overview

• Provide annotated literature, protein name dictionary and other information

• facilitate Natural Language Processing technology development

• Obtain literature source that describes protein entries

• Literature mining of protein phosphorylation

• Mapping protein/gene names to UniProtKB entries

• Text mining algorithm development using an annotated data set

Page 10: PIR: Protein Information Resource

iProLINK Structure

http://pir.georgetown.edu/pirwww/about/brochure.pdf

Page 11: PIR: Protein Information Resource

PIRSF Overview• A network with multiple levels of sequence diversity

• From superfamilies to subfamilies

• The primary PIRSF classification unit is the homeomorphic family

Homologous

Homeomorphic

• Manual curation for membership, etc.

• Retrieve reliable curated information for your protein sequence

Page 12: PIR: Protein Information Resource

Search and Analysis Tools• Text Search

• Batch Retrieval

• BLAST Search

• FASTA Search

• Related Sequence

• Peptide Match

• Pattern Match

• Multiple Alignment

• Pairwise Alignment

• ID Mapping

• Composition/Molecular Weight Calculation

Page 13: PIR: Protein Information Resource

•Demo… Demo… Demo… Demo… Demo…

Page 14: PIR: Protein Information Resource

Pros & Cons• Powerful

Provide many DBs and Tools

• Convenient

A Web Based Retrieval System

• PIRSF family classification system

Based on Evolutionary Relationships of Full-length Proteins

• Weak in Supporting Visualization of Data

Page 15: PIR: Protein Information Resource

Reference• http://pir.georgetown.edu/pirwww/about/brochure.pdf

• Wu CH, et al. The Protein Information Resource: an integrated public resource of functional annotation of proteins. Nucleic Acids Research, 30: 35-37, 2002.

• Huang H, et al.The PIR integrated protein databases and data retrieval system. Data Science 3: 163-174, 2004.