Estudiar para el examen Flashcards

(39 cards)

1
Q

is a model for enabling
ubiquitous, convenient, on-demand network
access to a shared pool of configurable
computing resources (e.g., networks, servers,
storage, applications, and services) that can be
rapidly provisioned and released with minimal
management effort or service provider
interaction

A

Cloud Computing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Que servicios da cloud computing?

A

Software-as-a-Service (SaaS)
Platform-as-a-Service (PaaS)
Infrastructure-as-a-Service (IaaS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Applications are accessible from several client devices
The provider is responsible for the application
Examples, SalesForce.com, NetSuit, Google, IBM, etc.

A

SaaS (Software as a Service)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The client is responsible for the end-to-end life cycle in
terms of developing, testing and deploying applications
Providers supplies all the systems
Examples are Google’s appEngine, Microsoft´s Azure, etc.

A

Paas (Platform as a service)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The service client has control over the operating
system, storage, and applications which are offered
through a Web-based access point
In this type of service the client manages the
storing and development environments for Cloud
Computing application such as the Hadoop
Distributed File System (HDFS) and the MapReduce
development framework.
Examples of infrastructure providers are GoGird,
AppNexeus, Eucalyptus, Amazon EC2, etc.

A

IaaS (Infrastrucutre as a Service)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

refers to an
eclectic and increasingly familiar group of nonrelational data management systems; where
databases are not built primarily on tables, and
generally do not use SQL for data manipulation.

A

NoSQL?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Que significan las siglas NoSQL?

A

not Only SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

De que manera esta diseñado NoSQL?

A

non-relational
databases designed for large-scale data storage
and for massively-parallel data processing
across a large number of commodity servers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

En que se enfoca las bases de datos NoSQL?

A

focus on
analytical processing of large scale datasets,
offering increased scalability over commodity
hardware

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

only two of
the following three different aspects of scaling
out can be achieved fully at the same time

A

teorema CAP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

all clients see the same version
of the data, even on updates to the dataset

A

Strong Consistency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

all clients can always find at least
one copy of the requested data, even if some of the
machines in a cluster is down

A

High Availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

the total system keeps its
characteristic even when being deployed on
different servers, transparent to the client

A

Partition-tolerance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

cuales son los 4 tipos de BD en las que se clasifica NoSQL?

A

Key-Value stores
Document databases (or stores)
WideColumn (or Column-Family) stores
Graph
databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

these Data
Management Systems (DMS) store items as alphanumeric identifiers (keys) and associated values in
simple, standalone tables (referred to as ―hash
tables‖). The values may be simple text strings or
more complex lists and sets.

A

Key-Value Stores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Ejemplos de Key Value Stores

A

Dynamo (Amazon);
Voldemort (LinkedIn); Redis; BerkeleyDB; Riak

17
Q

are designed to manage
and store documents which are encoded in a
standard data exchange format such as XML,
JSON or BSON

A

Document Databases

18
Q

This type of
NoSQL Database employs a distributed,
column-oriented data structure that
accommodates multiple attributes per key
These NoSQL Databases generally replicate not
just Google‘s Bigtable data storage structure,
but Google‘s distributed file system (GFS) and
MapReduce parallel processing framework as
well

A

wide-columns o column family

19
Q

replace
relational tables with structured relational
graphs of interconnected key-value pairings.
They are similar to object-oriented databases
as the graphs are represented as an objectoriented network of nodes

A

Graph Databases

20
Q

is a column-oriented
database management system that runs on top
of Hadoop Distributed File System (HDFS)

21
Q

en que se implemento Hbase?

A

HBase is an open-source implementation of the
Google BigTable architecture.

22
Q

HBAse provides consistent read
and write operations and thus can be used for
high speed requirements. This also helps to
increase the overall throughput of the system

23
Q

Atomic read and write
means that only one process can perform a
given task at a given time. For example when
one process is performing write operation no
other processes can perform the write
operation on that data.

A

Atomic Read and Write

24
Q

HBase offers automatic and manual
splitting of regions. This means that if a region
reaches its threshold size it automatially splits into
smaller sub regions.

24
HBase provides Local Area Network(LAN) and Wireless Area Network(WAN) which supports failure recovery. There is a master server which monitors all the regions and metadata of the cluster.
High Availability
24
HBase offers access through the Java API which helps to programmatically access HBase
Client API
25
This is one of the important characteristics of non-relational databases. HBase supports scalability both in linear and modular form
Scalability
26
This feature of HBase helps usage of distributed storage such as HDFS
Distributed Storage
26
HBase can run on top of various systems such as Hadoop/HDFS
HDFS/Hadoop integration
27
The data in HBase are replicated over a number of clusters. This helps to recover data in case of any loss and high availability of data
Data Replication
28
HBase supports Java API which makes it easily available programmatically using java
API Support
29
HBase supports map reduce which helps in parallel processing of data
MapReduce Support
30
HBase uses keys and stores it in lexicographical order thus optimizing the requests
Sorted Row Keys
31
HBase performs real time processing of data and supports block cache and bloom filters
Real Time Processing of Data
32
Reduced I/O is one of the primary reasons for this layout type
Hadoop Hbase-architecture
33
que conforma la estructura de las base de datos Hbase?
tables are made of rows and columns. All columns in HBase belong to a particular column family.
34
the intersection of row and column coordinates -- are versioned. A cell’s content is an uninterpreted array of bytes.
Table cells
35
In Hbase, table row keys are byte arrays so almost anything can serve as a row key from strings to binary representations of longs or even serialized data structures. Rows in HBase tables are sorted by row key.
row keys
36
* Tables are declared up front at schema definition time * Rows are lexicographically sorted with the lowest order appearing first in a table. * Columns are grouped into column families
aspectos a considerar al diseñar una BD HBase