Cloud Computing and Digital Libraries
First perspectives on a future technological alliance
Pedro Lopes | [email protected] | PDEI - Seminars | Jan 23, 2009
Outline
‣ Internet
‣ Cloud-Computing
‣ Digital Libraries Issues
• Services
• Digital Data Storage
‣ Cloud Libraries
[http://infreemation.net/wp-content/uploads/cloud-computing-kitchen-sink.jpg]
‣ Internet has changed, it is “the” platform for everything
• Accessible
• Anywhere
• Anytime
• Everyone
• Applications run in the browser
• Entertainment: Hi5, Facebook, IMDb...
• Productivity: Office Live, Google Apps...
• “Everything”-as-a-service
• Internet is the Cloud
Cloud Computing [Motivation]
How do novel applications work within the cloud?
‣ GRID > Virtualization > Cloud
Cloud Computing [Architecture]
[http://www.slideshare.net/ruv/meshu-cloud-camp]
Cloud Computing [Benefits]
‣ Resource flexibility
• Improve service level and availability
‣ Pay for what you use
• Cost saving, leveraging economies of scale
‣ Rapid prototyping and market testing
• Increased speed to market
‣ Self-service deployment
‣ Reduce lock-in and switching costs
Digital Libraries
‣ Store information in the digital world
• Digital knowledge
‣Make it available online
• With some restrictions
‣ Issues
• Lack of services / functionalities
• Difficulties digital data storage
Data Storage & Infrastructures
Get the data IN
Get the data OUT
Services [Problem]
‣ There’s an enormous amount of data
• But there’s a lack in advanced functionalities
‣What can we do with the data?
• Search
• View
‣What new services?
• Information mining
• Advanced queries
• Semantics
Services [Solution]
‣ Services can be easily created in the correct frameworks
‣ Cloud-Computing
• Built-in framework
• Easy development of new services
• Increases momentum in the developer community
• Faster testing and deployment
• More processing power
Digital Data Storage [Problem]
‣ Digital data
• Preservation
• Component failures
• Obsolescence
• Attacks
• Natural disasters
• Distribution
• Availability
• Heterogeneity
• Scalability
• Authenticity
Digital Data Storage [Solution]
‣ Preservation issues may be solved with distributed environments
‣ Cloud-computing
• Built-in architecture
• Distributed
• Every single data bit is replicated
• Metadata
‣ Geographic distribution
‣ Scalability
• Better availability is assured
Cloud Libraries [Conclusions]
‣ Information is the most important component
‣ “We shouldn’t care about anything else”
• Leave that to the cloud-computing environment
• Service development
• Distributed architecture
• Metadata
‣ Digital libraries and cloud-computing are a perfect match
• Create a digital library based on a cloud computing operating environment
• Microsoft Azure Services
• Amazon Elastic Compute Cloud
• Google App Engine
Questions?
Thank You