Computer Science and Engineering - Tutorials, Notes, MCQs, Questions and Answers: Keywords and Definitions

Showing posts with label Keywords and Definitions. Show all posts

Wednesday, February 4, 2015

Keywords and Definitions in Distributed Database

Keywords in Distributed Database / Keywords defined in Distributed Database / Two Marks Questions with Answers in Distributed Database

Keywords and Definitions

Replication (in Database)

Storing identical copies of same relations (tables) in different sites in a distributed database is called Replication.

For example, a relation R is sent to several sites as R. At the end of replication in n different sites, the following will hold;

R in Site 1 = R in Site 2 = R in Site 3 = ... = R in Site n

Fragmentation (in Database)

Partitioning a relation (table) vertically or horizontally into several partitions and storing each partition in different site is called Fragmentation in distributed database.

For example, a relation R is horizontally fragmented into n fragments. At the end of Horizontal fragmentation, every fragment is sent to n sites as follows;

R = R1 U R2 U R3 U ... U Rn

Or, a relation R is vertically fragmented into n fragments, and sent to n sites as follows;

R = R1

\bowtie

\bowtie

\bowtie

...

\bowtie

In-doubt Transaction

When a site recovers from failure, the first job is to check its own log file to set right all those transactions that were executing during the failure. In this event, the recovering site checks for the transaction control messages like <commit T>, <abort T>, or <ready T>.

In this process, if any transaction have only <ready T> messages and no <commit T> or <abort T>, then the recovering site cannot decide the fate of that particular transaction on its own. It means, the site has to contact other sites for deciding the fate of the transaction. Such a transaction is called as In-doubt transaction.

The Purpose of Commit Protocol in Distributed Database

To ensure Atomicity. That is, a distributed transaction would mean a single transaction which is executed at various sites simultaneously. At the end of the execution of that transaction, all sites must perform a commit or abort without any conflicts. Failing which would lead to an inconsistent database.

***********

Go to Distributed Database home

Thursday, August 21, 2014

Some Keywords and Definitions in DBMS

Useful keywords and definitions in DBMS / Two Mark definitions in DBMS / Need to know definitions in DBMS

1NF (First Normal Form)

2NF (Second Normal Form)

3NF (Third Normal Form)

A

Access time

ACID

Arity

Atomic Attribute

Atomicity (ACID property)

Attribute

Attribute Domain

B

Buffer Cache

Buffer manager

C

Candidate key

Cardinality (in SQL)

Cardinality Ratios

Cascadeless schedule

Composite Attribute

Conceptual Schema

Consistency (ACID property)

Correlated sub-query

D

Data Abstraction

Database Languages

Data Control Language (DCL)

Data Definition Language (DDL)

Data Dictionary

Data Manipulation Language (DML)

Data Redundancy

Data Transfer Rate

DBA Functions

Degree of a relation (table)

Degree of a relationship

Derived Attribute

Descriptive Attribute

Durability (ACID property)

E

Entity Set

Entity integrity

F

Foreign Key

Functional Dependency

Functions of Database Administrator

Full Functional Dependency

G

H

I

Instance

Isolation (ACID property)

J

K

L

Logical Data Independence

Logical Schema

Lost Update problem (concurrency control)

M

Many-to-Many Relationship

Many-to-One Relationship

Mapping Cardinality

Multi-Valued Attribute

N

Non-Prime Attribute (Non-Key attribute)

Normalization

O

One-to-Many Relationship

One-to-One Relationship

P

Partial Functional Dependency

Partial Participation

Participation

Physical Data Independence

Physical Schema

Primary Key

Prime Attribute

Q

R

Recoverable schedule

Referential integrity

Rotational latency

S

Schema

Seek time

Selectivity

Simple Attribute

Single-Valued Attribute

Strong Entity Set

Super Key

T

Total Participation

Trivial Functional Dependency

Type of relationship

U

V

W

Weak Entity Set

X

Y

Z

*********

Go to Database Management Systems home

important keywords in RDBMS

define concepts in database management systems

two mark questions in RDBMS

database management systems glossary

Encyclopedia of relational database management systems

Monday, February 24, 2014

Some Keywords in Parallel Database Systems

Some Keywords in Parallel Database Management System

Coarse Grain Parallel Machine

It is a parallel machine architecture in which parallelism is achieved by connecting small number of powerful processors. Such a system is used for executing more queries/transactions in parallel (in case of Interquery Parallelism).

Degree of Parallelism

The number of resources (say processors) need to do certain things in parallel without affecting (degrading) the performance of the parallel system.

Fine Grain Parallel Machine

It is a parallel machine architecture in which parallelism is achieved by connecting more number of simple processors. Such a system is used for parallelizing and executing a single query in parallel (in case of Intraquery Parallelism). Typically, scientific queries or more complex queries are executed this way.

Hash Partitioning

In parallel database, Hash partitioning is one of the data partitioning techniques (others, Round-robin and Range partitioning) primarily used for distributing tuples (records) among the chosen disks using a hash function. It works by using a hash function which is defined on an attribute or set of attributes on which partitions need to be made. Hash function should be designed to distribute the data into n disks which is finite.

Interference

Newly generated requests must compete with ongoing processes for sharing the resources like system bus, disks, or locks that are held by the ongoing processes. This activity might cause slow down in the overall processes. This kind of intervention is called interference.

Linear Speed-up

If the speedup of a parallel database system is N when N times larger system used for processing the requests handled by a smaller system is called Linear Speed-up.

Pipelined parallelism

It is a form of parallelism where the output of one processor is consumed as the input of other processor immediately after the first record of the result is generated.

Range Partitioning

In parallel database, Range partitioning is one of the data partitioning techniques (others, Hash and Round-robin partitioning) primarily used for distributing tuples (records) among the chosen disks using set of ranges. It works by choosing a range vector. The elements of range vector are the values of the partitioning attribute. Range vectors are chosen based on the number of disks available, which is finite.

Response Time

The amount of time consumed by a single machine (parallel machines in the case of parallel database) to finish a job since it is submitted for execution. This is one of the two performance measures of a database management system (the other one is Throughput).

Round-robin Partitioning

In parallel database, Round-robin partitioning is one of the data partitioning techniques (others, Hash and Range partitioning) primarily used for distributing tuples (records) among the chosen disks. It works by distributing the records of a table into n different disks in round-robin fashion like first record to first disk, second record to second disk and so on, iteratively. Here, n, the number of disks is finite value.

Scale-UP

It is the execution of larger task using parallel implementation of more resources in the same amount of time which was consumed by smaller task (increasing the degree of parallelism). Scaleup means enhanced throughput.

Shared Disk Architecture

It is one of the parallel architecture in parallel database management system. Many processors share a common disk/disk setup. Here, processors have their own memory. That is, every processor has its private memory.

Shared Memory Architecture

Shared Memory Architecture is one of parallel architecture in parallel database management system. Many processors share a common memory (RAM) and disk setup in this architecture, which really helps in inter processor communication.

Shared Nothing Architecture

This is yet another parallel architecture in parallel database systems. Every processor has its own memory and disk which are connected through high speed network. This is the architecture widely used in the case of Distributed Database Systems.

Skew

Skew is the effect causes the performance degradation of parallel system due to the uneven distribution of work load.

Speed-Up

It is the execution of a given task faster than before using parallel implementation of more resources (increasing the degree of parallelism). Speedup means improved response time.

Start-up Cost

In parallel database systems, the start-up cost is the cost associated with the task (queries/transactions) initiation. That is, if a task is to be done in parallel, then it has to be broken into pieces (equal) where every piece of task can be done on different processors.

Sub-linear Speed-up

A parallel system cannot show linear speedup always. It is due to the reasons like Skew, Interference, Startup and Assembling costs involved in parallelizing the process. Hence, the speedup is going to be little lesser than N (see Linear Speed-up). Such a speedup is called Sub-linear Speed-up.

Throughput

The number of jobs (jobs can be queries/transactions) that can be completed in a given amount of time by a machine (a parallel machine in the case of parallel database). This is one of the two performance measures of a database management system (the other one is Response Time).

TOPICS (Click to Navigate)

Wednesday, February 4, 2015

Keywords in Distributed Database / Keywords defined in Distributed Database / Two Marks Questions with Answers in Distributed Database

Keywords and Definitions

Thursday, August 21, 2014

Useful keywords and definitions in DBMS / Two Mark definitions in DBMS / Need to know definitions in DBMS

A

Attribute Domain

B

C

Conceptual Schema

D

E

Entity Set

F

Foreign Key

G

H

I

J

K

L

Logical Schema

Lost Update problem (concurrency control)

M

Many-to-One Relationship

N

O

P

Participation

Physical Schema

Prime Attribute

Q

R

S

Selectivity

Simple Attribute

T

U

V

W

X

Y

Z