Data refers to a collection of facts, statistics, or information that is organized and stored for analysis, interpretation, and use in various contexts. Data refers to a compilation of discrete and individual units of information. It can be utilized in various formats such as text, numerical data, multimedia, bytes, and so on. It can be stored in physical mediums such as paper or digital storage devices.
The term 'Data' is derived from the word 'datum', which refers to a singular item of information. The term "data" is the plural form of the word "datum." In the field of computers, data refers to information that may be converted into a format that allows for efficient transfer and manipulation.
A database is a structured collection of data that is organized and stored in a computer system. It allows for efficient storage, retrieval, and manipulation of large amounts of information.
A database is a cohesive collection of interconnected data that is utilized for effective retrieval, insertion, and deletion. It is also employed to arrange the data in a structured manner, such as in a table, schema, views, and reports, among other formats.
For instance, the College Database efficiently categorizes information pertaining to the administration, staff, students, and teachers, among other entities. The database allows for effortless retrieval, insertion, and deletion of information.
A database management system (DBMS) is a software application used to efficiently manage and organize databases. For instance, MySQL and Oracle are widely utilized commercial databases in many applications.
DBMS offers an interface to execute a range of tasks, including database creation, data storage, data updating, and table creation within the database.
It offers safeguarding and assurance to the database. When there are several users, it ensures the uniformity of data.
DBMS enables users to perform the following tasks:
- Data Definition: It is utilized to create, modify, and delete the definition that specifies the arrangement of data in the database.
- Data Updation: It refers to the process of inserting, modifying, or deleting actual data in a database.
- Data retrieval is the process of extracting information from a database that can be utilized by programs for various purposes.
- User Administration is a system that is utilized to register and monitor users, uphold the integrity of data, enforce data security, manage concurrency control, monitor performance, and recover information that has been corrupted due to unforeseen failures.
Attributes of a Database Management System (DBMS):
- The information is stored and managed using a digital repository established on a server.
- It offers a coherent and rational perspective on the procedure that alters data.
- DBMS includes automated backup and recovery protocols.
- The system incorporates ACID features to ensure data integrity and consistency in the event of a failure.
- It has the ability to simplify the intricate connection between data.
- It facilitates the manipulation and processing of data.
- Its purpose is to ensure the confidentiality and integrity of data.
- The database can be accessed from several perspectives based on the user's needs.
Benefits of a Database Management System (DBMS)
- Manages database redundancy: It effectively manages data redundancy by consolidating all data into a single database file and storing recorded data within the database.
- Data sharing: In a Database Management System (DBMS), the data can be shared among various users who have been granted authorization by the organization.
- Effortlessly Maintenance: The database system's centralized design facilitates easy maintenance.
- Time reduction: It decreases the amount of time required for development and minimizes the requirement for maintenance.
- Backup: The system includes backup and recovery subsystems that automatically produce backups of data in the event of hardware or software problems, and restore the data when necessary.
- The multiple user interface offers several forms of user interfaces, including graphical user interfaces and application program interfaces.
Drawbacks of a Database Management System (DBMS)
- Running DBMS software necessitates a high-speed data processor and a large memory size.
- It requires a significant amount of disk space and memory to run efficiently.
- Database systems introduce added intricacy and necessitate additional specifications.
- The impact of failure on the database is significant. In many organizations, all data is saved in a single database.
- If the database is damaged due to electrical failure or corruption, the data may be permanently lost.
Database managers establish a database in a manner that restricts data access to all users through a single software program.
The primary objective of the database is to efficiently handle a substantial volume of information through the processes of storing, retrieving, and managing data.
There is a wide range of databases accessible, such as MySQL, Sybase, Oracle, MongoDB, Informix, PostgreSQL, SQL Server, and more.
SQL, also known as Structured Query Language, is a programming language used to manipulate and interact with data contained in a database. SQL relies on the principles of relational algebra and tuple relational calculus.
The database has undergone almost 50 years of evolution, transitioning from a flat-file system to relational and object-relational systems. It has undergone multiple iterations.
Historical Background
Based on Files
The year 1968 marked the introduction of File-Based databases. In file-based databases, data was stored in a single, unstructured file. While files offer numerous benefits, they also have several constraints.
An important benefit of the file system is its diverse range of access techniques, such as sequential, indexed, and random.
Extensive programming in a third-generation language like COBOL or BASIC is necessary.
The hierarchical data model is a method of organizing data in a hierarchical structure.
The Hierarchical Database dominated the period from 1968 to 1980.
The IBM's initial DBMS was based on the prominent hierarchical database paradigm. The system was referred to as IMS, which stands for Information Management System.
Files in this approach are organized hierarchically, with each file having a parent and potentially multiple children.
Network Data Model
The network data model is a conceptual model used to organize and represent data in a network structure.
Charles Bachman pioneered the development of the initial Database Management System (DBMS) known as Integrated Data Store (IDS) while working at Honeywell. The development of this technology took place in the early 1960s, and it was later standardized in 1971 by the CODASYL group (Conference on Data Systems Languages).
In this paradigm, files are associated with owners and members, similar to the conventional network model.
However, this approach was constrained by restrictions such as intricate system complexity and challenges in its design and maintenance.
Relational Database
A relational database is a structured collection of data organized in tables, where the data is stored in a way that allows for efficient retrieval and manipulation.
1970 - Present: It is the era of Relational Database and Database Management. In 1970, the relational model was proposed by E.F. Codd.
The relational database model consists of two primary concepts known as instance and schema. The instance is a table with rows or columns. Schema specifies the structure like name of the relation, type of each column and name.
This paradigm employs mathematical concepts such as set theory and predicate logic. The first internet database application had been created in 1995. During the era of the relational database, many more models had introduced like object-oriented model, object-relational model, etc.
Cloud database
Cloud database facilitates you to store, manage, and retrieve their structured, unstructured data via a cloud platform. This data is accessible over the Internet. Cloud databases are commonly referred to as Database as a Service (DBaaS) due to their provision as a managed service.
Benefits of utilizing a cloud-based database
Reduce expenses.
Typically, a corporate provider is not required to make investments in databases. It can maintain and support one or more data centers.
Automated
Cloud databases are enriched with a variety of automated processes such as recovery, failover, and auto-scaling.
Increased accessibility
You can access your cloud-based database from any location, anytime. All you need is just an internet connection.
NoSQL Database
A NoSQL database is an approach to design such databases that can accommodate a wide variety of data models. NoSQL stands for "not only SQL." It is an alternative to traditional relational databases in which data is placed in tables, and data schema is perfectly designed before the database is built.
NoSQL databases are useful for a large set of distributed data. Some examples of NoSQL database system with their category are:
MongoDB, CouchDB, Cloudant (Document-based)
Memcached, Redis, Coherence (key-value store)
HBase, Big Table, Accumulo (Tabular).
Advantage of NoSQL:
- NoSQL can handle an extensive amount of data because of scalability. If the data grows, NoSQL database scale it to handle that data in an efficient manner.
- NoSQL supports auto replication. Auto replication makes it highly available because, in case of any failure, data replicates itself to the previous consistent state.
- NoSQL is an open-source database, so there is no reliable standard for NoSQL yet.
- Data management in NoSQL is much more complicated than relational databases. It is very challenging to install and even more hectic to manage daily.
- GUI tools for NoSQL database are not easily available in the market.
- Backup is a great weak point for NoSQL databases. Some databases, like MongoDB, have no powerful approaches for data backup.
The Object-Oriented Databases
The object-oriented databases contain data in the form of object and classes. Objects are the real-world entity, and types are the collection of objects. An object-oriented database is a combination of relational model features with objects oriented principles.
It is an alternative implementation to that of the relational model.
Object-oriented databases hold the rules of object-oriented programming. An object-oriented database management system is a hybrid application.
The object-oriented database model contains the following properties.
Object-oriented programming properties:
Objects,
Classes,
Inheritance,
Polymorphism,
Encapsulation,
Relational database properties:
Atomicity,
Consistency,
Integrity,
Durability,
Concurrency,
Query processing
Graph Databases
A graph database is a NoSQL database. It is a graphical representation of data.
It contains nodes and edges. A node represents an entity, and each edge represents a relationship between two edges. Every node in a graph database represents a unique identifier.
Graph databases are beneficial for searching the relationship between data because they highlight the relationship between relevant data.
Graph databases are very useful when the database contains a complex relationship and dynamic schema.
It is mostly used in supply chain management, identifying the source of IP telephony.
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.