06-10-2012, 04:27 PM
Overview of Cloud Technologies and Parallel Programming Frameworks for Scientific Applications
Overview of Cloud Technologies.pptx (Size: 1.76 MB / Downloads: 60)
Moving towards..
Computing Clouds
Cloud Infrastructure Services
Cloud infrastructure software
Distributed File Systems
HDFS, etc..
Distributed Key-Value stores
Data intensive parallel application frameworks
MapReduce
High level languages
Science in the clouds
Virtualization
Goals
Server consolidation
Co-located hosting & on demand provisioning
Secure platforms (eg: sandboxing)
Application mobility & server migration
Multiple execution environments
Saved images and Appliances, etc
Different virtualization techniques
User mode Linux
Pure virtualization (eg:Vmware)
Hard till processor came up with virtualization extensions (hardware assisted virtualization)
Para virtualization (eg: Xen)
Modified guest OS’s
Programming language virtual machines
Cloud Computing
On demand computational services over web
Spiky compute needs of the scientists
Horizontal scaling with no additional cost
Increased throughput
Public Clouds
Amazon Web Services, Windows Azure, Google AppEngine, …
Private Cloud Infrastructure Software
Eucalyptus, Nimbus, OpenNebula
Cloud Infrastructure Software Stacks
Manage provisioning of virtual machines for a cloud providing infrastructure as a service
Coordinates many components
Hardware and OS
Network, DNS, DHCP
VMM Hypervisor
VM Image archives
User front end, etc..
Cloud Infrastructure Services
Cloud infrastructure services
Storage, messaging, tabular storage
Cloud oriented services guarantees
Distributed, highly scalable & highly available, low latency
Consistency tradeoff’s
Virtually unlimited scalability
Minimal management / maintenance overhead
Google BigTable
Data Model
A sparse, distributed, persistent multidimensional sorted map
Indexed by a row key, column key, and a timestamp
A table contains column families
Column keys grouped in to column families
Row ranges are stored as tablets (Sharding)
Supports single row transactions
Use Chubby distributed lock service to manage masters and tablet locks
Based on GFS
Supports running Sawzal scripts and map reduce