Estudiar para el examen Flashcards by Sandra Prieto

is a model for enabling
ubiquitous, convenient, on-demand network
access to a shared pool of configurable
computing resources (e.g., networks, servers,
storage, applications, and services) that can be
rapidly provisioned and released with minimal
management effort or service provider
interaction

Cloud Computing

How well did you know this?

Not at all

Perfectly

Que servicios da cloud computing?

Software-as-a-Service (SaaS)
Platform-as-a-Service (PaaS)
Infrastructure-as-a-Service (IaaS)

How well did you know this?

Not at all

Perfectly

Applications are accessible from several client devices
The provider is responsible for the application
Examples, SalesForce.com, NetSuit, Google, IBM, etc.

SaaS (Software as a Service)

How well did you know this?

Not at all

Perfectly

The client is responsible for the end-to-end life cycle in
terms of developing, testing and deploying applications
Providers supplies all the systems
Examples are Google’s appEngine, Microsoft´s Azure, etc.

Paas (Platform as a service)

How well did you know this?

Not at all

Perfectly

The service client has control over the operating
system, storage, and applications which are offered
through a Web-based access point
In this type of service the client manages the
storing and development environments for Cloud
Computing application such as the Hadoop
Distributed File System (HDFS) and the MapReduce
development framework.
Examples of infrastructure providers are GoGird,
AppNexeus, Eucalyptus, Amazon EC2, etc.

IaaS (Infrastrucutre as a Service)

How well did you know this?

Not at all

Perfectly

refers to an
eclectic and increasingly familiar group of nonrelational data management systems; where
databases are not built primarily on tables, and
generally do not use SQL for data manipulation.

NoSQL?

How well did you know this?

Not at all

Perfectly

Que significan las siglas NoSQL?

not Only SQL

How well did you know this?

Not at all

Perfectly

De que manera esta diseñado NoSQL?

non-relational
databases designed for large-scale data storage
and for massively-parallel data processing
across a large number of commodity servers

How well did you know this?

Not at all

Perfectly

En que se enfoca las bases de datos NoSQL?

focus on
analytical processing of large scale datasets,
offering increased scalability over commodity
hardware

How well did you know this?

Not at all

Perfectly

only two of
the following three different aspects of scaling
out can be achieved fully at the same time

teorema CAP

How well did you know this?

Not at all

Perfectly

all clients see the same version
of the data, even on updates to the dataset

Strong Consistency

How well did you know this?

Not at all

Perfectly

all clients can always find at least
one copy of the requested data, even if some of the
machines in a cluster is down

High Availability

How well did you know this?

Not at all

Perfectly

the total system keeps its
characteristic even when being deployed on
different servers, transparent to the client

Partition-tolerance

How well did you know this?

Not at all

Perfectly

cuales son los 4 tipos de BD en las que se clasifica NoSQL?

Key-Value stores
Document databases (or stores)
WideColumn (or Column-Family) stores
Graph
databases

How well did you know this?

Not at all

Perfectly

these Data
Management Systems (DMS) store items as alphanumeric identifiers (keys) and associated values in
simple, standalone tables (referred to as ―hash
tables‖). The values may be simple text strings or
more complex lists and sets.

Key-Value Stores

How well did you know this?

Not at all

Perfectly

Ejemplos de Key Value Stores

Study These Flashcards

Dynamo (Amazon);
Voldemort (LinkedIn); Redis; BerkeleyDB; Riak

are designed to manage
and store documents which are encoded in a
standard data exchange format such as XML,
JSON or BSON

Study These Flashcards

Document Databases

This type of
NoSQL Database employs a distributed,
column-oriented data structure that
accommodates multiple attributes per key
These NoSQL Databases generally replicate not
just Google‘s Bigtable data storage structure,
but Google‘s distributed file system (GFS) and
MapReduce parallel processing framework as
well

Study These Flashcards

wide-columns o column family

replace
relational tables with structured relational
graphs of interconnected key-value pairings.
They are similar to object-oriented databases
as the graphs are represented as an objectoriented network of nodes

Study These Flashcards

Graph Databases

is a column-oriented
database management system that runs on top
of Hadoop Distributed File System (HDFS)

Study These Flashcards

HBase

en que se implemento Hbase?

Study These Flashcards

HBase is an open-source implementation of the
Google BigTable architecture.

HBAse provides consistent read
and write operations and thus can be used for
high speed requirements. This also helps to
increase the overall throughput of the system

Study These Flashcards

Consistency

Atomic read and write
means that only one process can perform a
given task at a given time. For example when
one process is performing write operation no
other processes can perform the write
operation on that data.

Study These Flashcards

Atomic Read and Write

HBase offers automatic and manual
splitting of regions. This means that if a region
reaches its threshold size it automatially splits into
smaller sub regions.

Study These Flashcards

Sharding

HBase provides Local Area Network(LAN) and Wireless Area Network(WAN) which supports failure recovery. There is a master server which monitors all the regions and metadata of the cluster.

High Availability

HBase offers access through the Java API which helps to programmatically access HBase

Client API

This is one of the important characteristics of non-relational databases. HBase supports scalability both in linear and modular form

Scalability

This feature of HBase helps usage of distributed storage such as HDFS

Distributed Storage

HBase can run on top of various systems such as Hadoop/HDFS

HDFS/Hadoop integration

The data in HBase are replicated over a number of clusters. This helps to recover data in case of any loss and high availability of data

Data Replication

HBase supports Java API which makes it easily available programmatically using java

API Support

HBase supports map reduce which helps in parallel processing of data

MapReduce Support

HBase uses keys and stores it in lexicographical order thus optimizing the requests

Sorted Row Keys

HBase performs real time processing of data and supports block cache and bloom filters

Real Time Processing of Data

Reduced I/O is one of the primary reasons for this layout type

Hadoop Hbase-architecture

que conforma la estructura de las base de datos Hbase?

tables are made of rows and columns. All columns in HBase belong to a particular column family.

the intersection of row and column coordinates -- are versioned. A cell’s content is an uninterpreted array of bytes.

Table cells

In Hbase, table row keys are byte arrays so almost anything can serve as a row key from strings to binary representations of longs or even serialized data structures. Rows in HBase tables are sorted by row key.

row keys

* Tables are declared up front at schema definition time * Rows are lexicographically sorted with the lowest order appearing first in a table. * Columns are grouped into column families

aspectos a considerar al diseñar una BD HBase

Estudiar para el examen Flashcards

(39 cards)