View
217
Download
0
Category
Tags:
Preview:
Citation preview
Biological Databases [S2] - ENRIQUE BLANCO 2013 eblanco@ub.edu
1. COMPUTER ANALYSIS
>sequenceTACGTACGTAGCTAGCTAGCTACGTAGCTAGCTAGCTACGTAGCTAATGTCGAAGTAACGTACGATCGTAGCTAGCTAGCTGATGCTATCGTAGCTAGCTGATGCATGCGCTAAACACATCGCTTTGGCACGAGCTAGCTAGCTACTACAGCACGGGGGCACGTAGTGCAGCTAGCAGCCGCCGCATCGCCCCCCGATCGATCGTAGCCGACGATCTACTACGTAGCGACTGACTGATCGATGAGGATCGTGAGCTAGCGTGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCAGCTACGTACGTAGCTAGCTACGAGCAGCTAGCTAGCTACGAC
+ =
GENE1 FEATURE1 FEATURE2 … FEATUREm
GENE2 FEATURE1 FEATURE2 … FEATUREm
GENEn FEATURE1 FEATURE2 … FEATUREm
…
Biological Databases [S2] - ENRIQUE BLANCO 2013 eblanco@ub.edu
2. ONE DATA SET
GENE1 FEATURE1 FEATURE2 … FEATUREm
GENE2 FEATURE1 FEATURE2 … FEATUREm
GENEn FEATURE1 FEATURE2 … FEATUREm
…
SORT
REARRANGE
FEATUREm+1
FEATUREm+1
FEATUREm+1
ADD/CONVERT/EXTRACT
FILTER
Biological Databases [S2] - ENRIQUE BLANCO 2013 eblanco@ub.edu
3. TWO DATA SETS
GENE1FEATURE1 FEATURE2
GENE2FEATURE1 FEATURE2
GENEmFEATURE1 FEATURE2
…
GENE1
GENE2
GENEn
…
FEATURE1 FEATURE3
FEATURE1 FEATURE3
FEATURE1 FEATURE3
Biological Databases [S2] - ENRIQUE BLANCO 2013 eblanco@ub.edu
4. PIPELINES/WORKFLOWS
>chromosome1TACGTACGTAGCTAGCTAGCTACGTAGCTAGCTAGCTACGTAGCTAATGTCGAAGTAACGTACGATCGTAGCTAGCTAGCTGATGCTATCGTAGCTAGCTGATGCATGCGCTAAACACATCGCTTTGGCACGAGCTAGCTAGCTACTACAGCACGGGGGCACGTAGTGCAGCTAGCAGCCGCCGCATCGCCCCCCGATCGATCGTAGCCGACGATCTACTACGTAGCGACTGACTGATCGATGAGGATCGTGAGCTAGCGTGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCAGCTACGTACGTAGCTAGCTACGAGCAGCTAGCTAGCTACGAC
>chromosome22TACGTACGTAGCTAGCTAGCTACGTAGCTAGCTAGCTACGTAGCTAATGTCGAAGTAACGTACGATCGTAGCTAGCTAGCTGATGCTATCGTAGCTAGCTGATGCATGCGCTAAACACATCGCTTTGGCACGAGCTAGCTAGCTACTACAGCACGGGGGCACGTAGTGCAGCTAGCAGCCGCCGCATCGCCCCCCGATCGATCGTAGCCGACGATCTACTACGTAGCGACTGACTGATCGATGAGGATCGTGAGCTAGCGTGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCAGCTACGTACGTAGCTAGCTACGAGCAGCTAGCTAGCTACGAC
…
>sequenceTACGTACGTAGCTAGCTAGCTACGTAGCTAGCTAGCTACGTAGCTAATGTCGAAGTAACGTACGATCGTAGCTAGCTAGCTGATGCTATCGTAGCTAGCTGATGCATGCGCTAAACACATCGCTTTGGCACGAGCTAGCTAGCTACTACAGCACGGGGGCACGTAGTGCAGCTAGCAGCCGCCGCATCGCCCCCCGATCGATCGTAGCCGACGATCTACTACGTAGCGACTGACTGATCGATGAGGATCGTGAGCTAGCGTGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCAGCTACGTACGTAGCTAGCTACGAGCAGCTAGCTAGCTACGAC
+
Recommended