16
Biology: Protein Biology: Protein Structure Prediction Structure Prediction What is Protein Proteins are large molecules consisting of amino acids which our bodies and the cells in our bodies need to function properly. Our body structures, functions, the regulation of the body's cells, tissues and organs cannot exist without proteins. Our muscles, skin, bones and many other parts of the body contain significant amounts of protein. Protein accounts for 20%

Biology protein structure in cloud computing

Embed Size (px)

Citation preview

Page 1: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPrediction

What is ProteinProteins are large molecules consisting of amino acids which our bodies and the cells in our bodies need to function properly.

Our body structures, functions, the regulation of the body's cells, tissues and organs cannot exist without proteins.

Our muscles, skin, bones and many other parts of the body contain significant amounts of protein. Protein accounts for 20% of total body Weight.

Page 2: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPredictionWhy we need Protein Your hair, your nails, and the outer layers of your skin are

made of the protein keratin. Keratin is ascleroprotein, or a protein resistant to digestive enzymes. So if you bite your nails, you can’t digest them.

Bone has plenty of protein. The outer part of bone is hardened with minerals such as calcium but the basic, rubbery inner structure is protein, and bone marrow, the soft material inside the bone, also contains protein.

Red blood cells contain hemoglobin, a protein compound that carries oxygen throughout the body. Plasma, the clear fluid in blood, contains fat and protein particles known as lipoproteins, which ferry cholesterol around and out of the body.

Finally, proteins play an important part in the creation of every new cell and every new individual. Your chromosomes consist of nucleoproteins, which are substances made of amino acids and nucleic acids.

Page 3: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPrediction

Structure of protein

Page 4: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPredictionApplication or s/w that requires high computing capabilities and they are having large data sets may cause high I/O operations.

Due to these requirements they are overusing the super computing and cluster computing Infrastructures.

Protein structure Prediction is a computationally intensive task fundamental for different types research in the life sciences.

The prediction of the protein structure will help the medical scientists to develop new drugs.

Page 5: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPrediction

This task requires the investigation of protein structure at so many number of states and also it is creating a large no of computing calculations for all of these states.

The computational Power required for this prediction can now be acquired online, without owning it.

cloud computing grants the access to such capacity on pay per use basis.

Page 6: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPredictionA project that can analyze the use of cloud

Technologies for protein structure prediction is JEEVA PORTAL.

It is an integrated web portal that enables the scientists to

Do the prediction task using cloud techniques.

This prediction Task uses machine learning techniques (SVM =support vector machines ) for explaining the secondary structure of proteins.

These techniques will convert the problem in a manner so that they can be classified into 3 phases :initialization, classification and a final phase.

Page 7: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPredictionAs It is already cleared By it’s name it the first phase of this prediction named “Initialization of protein structure prediction”.

The actual Prediction starts in the initialization phase .

In the second phase the execution is get completed concurrently.

This will reduce the computational time.

The prediction algorithm is then transformed into a Task graph and that is submitted to Aneka

Page 8: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPredictionAneka is a platform and a framework for

developing distributed applications on the Cloud. It harnesses the spare CPU cycles of a heterogeneous network of desktop PCs and servers or datacenters on demand.

Aneka provides developers with a rich set of APIs for transparently exploiting such resources and expressing the business logic of applications by using the preferred programming abstractions.

System administrators can leverage on a collection of tools to monitor and control the deployed infrastructure. This can be a public cloud available to anyone through the Internet, or a private cloud constituted by a set of nodes with restricted access.

Page 9: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPrediction

Page 10: Biology protein structure in cloud computing

Biology: Protein Structure Biology: Protein Structure PredictionPrediction Jeeva is a computational platform which simplifies

the development of new prediction algorithms and improves the efficiency at the same time.

Jeeva web portal system consists of an interactive web interface and a Grid middleware.

With the interactive web interface, users can submit prediction requests for protein secondary structures, collect results, and manage the history of prediction data.

By means of the Grid middleware, researchers can not only deploy their prediction applications in a distributed environment easily, but also monitor and manage the execution in the distributed environment.

Page 11: Biology protein structure in cloud computing

Biology: Gene-Expression data analysis For Cancer Biology: Gene-Expression data analysis For Cancer DiagnosisDiagnosis..

What is Cancer

The disease caused by an uncontrolled division of abnormal cells in a part of the body.

A malignant growth or tumor resulting from such a division of cells.

Cancer is a term used for diseases in which abnormal cells divide without control and are able to invade other tissues. Cancer cells can spread to other parts of the body through the blood and lymph systems.

Page 12: Biology protein structure in cloud computing

Biology: Gene-Expression data analysis For Biology: Gene-Expression data analysis For Cancer DiagnosisCancer Diagnosis

Cancer types can be grouped into broader categories. The main categories of cancer include:

Carcinoma - cancer that begins in the skin or in tissues that line or cover internal organs. There are a number of subtypes of carcinoma, including adenocarcinoma, basal cell carcinoma,squamous cell carcinoma and transitional cell carcinoma.

Sarcoma - cancer that begins in bone, cartilage, fat, muscle, blood vessels, or other connective or supportive tissue.

Leukemia - cancer that starts in blood-forming tissue such as the bone marrow and causes large numbers of abnormal blood cells to be produced and enter the blood.

Lymphoma and myeloma - cancers that begin in the cells of the immune system.

Central nervous system cancers - cancers that begin in the tissues of the brain and spinal cord.

Page 13: Biology protein structure in cloud computing

Biology: Gene-Expression data analysis For Biology: Gene-Expression data analysis For Cancer DiagnosisCancer Diagnosis

Division Of Cancer and Non- Cancer Cells

Page 14: Biology protein structure in cloud computing

Biology: Gene-Expression data analysis For Biology: Gene-Expression data analysis For Cancer DiagnosisCancer Diagnosis

Gene expression analysis is a process of analyzing the hundreds and thousands of genes at a time.

The main Application of gene expression is cancer diagnosis and it’s treatment.

As we Know that the cancer occurs due to uncontrolled growth and division of cells.

This is because the mutation of genes that regulates the cell growth.

All the cancerous cells contains mutated genes.

Page 15: Biology protein structure in cloud computing

Biology: Gene-Expression data analysis For Biology: Gene-Expression data analysis For Cancer DiagnosisCancer DiagnosisThis Problem can be solved using the

classifiers or by using algorithms.

One of the classifier is Extended Classifier system.

It is used for utilize the large data-sets in bio-informatics and computer-science domains.

A variation of XCS is CoXcs which is proved effective in these conditions.

CoXcs is divided the entire search space into sub domains and these sub domains can be solved concurrently.

Page 16: Biology protein structure in cloud computing

Biology: Gene-Expression data analysis For Biology: Gene-Expression data analysis For Cancer DiagnosisCancer Diagnosiscloud CoXcs is a cloud based implementation

of CoXcs that leverages the Aneka to solve the problems and compose their outcomes.

Because of this Dynamic nature of XCS the number of required computing resources to execute it may change time to time.