Estudiar para el examen Flashcards
(39 cards)
is a model for enabling
ubiquitous, convenient, on-demand network
access to a shared pool of configurable
computing resources (e.g., networks, servers,
storage, applications, and services) that can be
rapidly provisioned and released with minimal
management effort or service provider
interaction
Cloud Computing
Que servicios da cloud computing?
Software-as-a-Service (SaaS)
Platform-as-a-Service (PaaS)
Infrastructure-as-a-Service (IaaS)
Applications are accessible from several client devices
The provider is responsible for the application
Examples, SalesForce.com, NetSuit, Google, IBM, etc.
SaaS (Software as a Service)
The client is responsible for the end-to-end life cycle in
terms of developing, testing and deploying applications
Providers supplies all the systems
Examples are Google’s appEngine, Microsoft´s Azure, etc.
Paas (Platform as a service)
The service client has control over the operating
system, storage, and applications which are offered
through a Web-based access point
In this type of service the client manages the
storing and development environments for Cloud
Computing application such as the Hadoop
Distributed File System (HDFS) and the MapReduce
development framework.
Examples of infrastructure providers are GoGird,
AppNexeus, Eucalyptus, Amazon EC2, etc.
IaaS (Infrastrucutre as a Service)
refers to an
eclectic and increasingly familiar group of nonrelational data management systems; where
databases are not built primarily on tables, and
generally do not use SQL for data manipulation.
NoSQL?
Que significan las siglas NoSQL?
not Only SQL
De que manera esta diseñado NoSQL?
non-relational
databases designed for large-scale data storage
and for massively-parallel data processing
across a large number of commodity servers
En que se enfoca las bases de datos NoSQL?
focus on
analytical processing of large scale datasets,
offering increased scalability over commodity
hardware
only two of
the following three different aspects of scaling
out can be achieved fully at the same time
teorema CAP
all clients see the same version
of the data, even on updates to the dataset
Strong Consistency
all clients can always find at least
one copy of the requested data, even if some of the
machines in a cluster is down
High Availability
the total system keeps its
characteristic even when being deployed on
different servers, transparent to the client
Partition-tolerance
cuales son los 4 tipos de BD en las que se clasifica NoSQL?
Key-Value stores
Document databases (or stores)
WideColumn (or Column-Family) stores
Graph
databases
these Data
Management Systems (DMS) store items as alphanumeric identifiers (keys) and associated values in
simple, standalone tables (referred to as ―hash
tables‖). The values may be simple text strings or
more complex lists and sets.
Key-Value Stores
Ejemplos de Key Value Stores
Dynamo (Amazon);
Voldemort (LinkedIn); Redis; BerkeleyDB; Riak
are designed to manage
and store documents which are encoded in a
standard data exchange format such as XML,
JSON or BSON
Document Databases
This type of
NoSQL Database employs a distributed,
column-oriented data structure that
accommodates multiple attributes per key
These NoSQL Databases generally replicate not
just Google‘s Bigtable data storage structure,
but Google‘s distributed file system (GFS) and
MapReduce parallel processing framework as
well
wide-columns o column family
replace
relational tables with structured relational
graphs of interconnected key-value pairings.
They are similar to object-oriented databases
as the graphs are represented as an objectoriented network of nodes
Graph Databases
is a column-oriented
database management system that runs on top
of Hadoop Distributed File System (HDFS)
HBase
en que se implemento Hbase?
HBase is an open-source implementation of the
Google BigTable architecture.
HBAse provides consistent read
and write operations and thus can be used for
high speed requirements. This also helps to
increase the overall throughput of the system
Consistency
Atomic read and write
means that only one process can perform a
given task at a given time. For example when
one process is performing write operation no
other processes can perform the write
operation on that data.
Atomic Read and Write
HBase offers automatic and manual
splitting of regions. This means that if a region
reaches its threshold size it automatially splits into
smaller sub regions.
Sharding