42
Chapter 4 Chapter 4 The Database Management The Database Management System Concept System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation by: Amita Goyal Chin, Ph.D. Virginia Commonwealth University John Wiley & Sons, Inc.

Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

Embed Size (px)

Citation preview

Page 1: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

Chapter 4Chapter 4The Database Management The Database Management

System ConceptSystem Concept

Fundamentals of Database Management Systemsby

Mark L. Gillenson, Ph.D.

University of Memphis

Presentation by: Amita Goyal Chin, Ph.D.

Virginia Commonwealth University

John Wiley & Sons, Inc.

Page 2: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-22

Chapter ObjectivesChapter Objectives

Discuss the problems encountered in a Discuss the problems encountered in a nondatabase information systems nondatabase information systems environment.environment.

List the five basic principles of the List the five basic principles of the database concept. database concept.

Describe how data can be considered to Describe how data can be considered to be a manageable resource. be a manageable resource.

Page 3: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-33

Chapter ObjectivesChapter Objectives

List the three problems created by data List the three problems created by data redundancy. redundancy.

Describe the nature of data redundancy Describe the nature of data redundancy among many files. among many files.

Explain the relationship between data Explain the relationship between data integration and data redundancy in one integration and data redundancy in one file.file.

Page 4: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-44

Chapter ObjectivesChapter Objectives

State the primary defining feature of a State the primary defining feature of a database management system. database management system.

Explain why the ability to store multiple Explain why the ability to store multiple relationships is an important feature of the relationships is an important feature of the database approach.database approach.

Page 5: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-55

Chapter ObjectivesChapter Objectives

Explain why providing support for such Explain why providing support for such control issues as data security, backup control issues as data security, backup and recovery, and concurrency is an and recovery, and concurrency is an important feature of the database important feature of the database approach. approach.

Explain why providing support for data Explain why providing support for data independence is an important feature of independence is an important feature of the database approach. the database approach.

Page 6: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-66

Data Processing SystemsData Processing Systems

Data was stored in different formats in different Data was stored in different formats in different files.files.

Data was often not shared among different Data was often not shared among different programs that needed it, necessitating the programs that needed it, necessitating the duplication of data in redundant files.duplication of data in redundant files.

Little was understood about file design, resulting Little was understood about file design, resulting in redundant data within individual files.in redundant data within individual files.

Page 7: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-77

Data Processing SystemsData Processing Systems

Files often could not be rebuilt after being Files often could not be rebuilt after being damaged by a software error or a damaged by a software error or a hardware failure.hardware failure.

Data was not secure and was vulnerable Data was not secure and was vulnerable to theft or malicious mischief by people to theft or malicious mischief by people inside or outside of the company.inside or outside of the company.

Page 8: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-88

Data Processing SystemsData Processing Systems

Programs were usually written in such a Programs were usually written in such a manner that if the way that the data was manner that if the way that the data was stored changed, the program had to be stored changed, the program had to be modified to continue working.modified to continue working.

Changes in everything from access Changes in everything from access methods to tax tables required methods to tax tables required programming changes.programming changes.

Page 9: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-99

The Database ConceptThe Database Concept

1. Data as a Manageable Resource1. Data as a Manageable Resource

2. Data Integration and Data Redundancy2. Data Integration and Data Redundancy

3. Multiple Relationships3. Multiple Relationships

4. Data Control Issues4. Data Control Issues

5. Data Independence5. Data Independence

Page 10: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1010

The Database ConceptThe Database Concept

Data as a Manageable Resource - The Data as a Manageable Resource - The creation of a data-centric environment in creation of a data-centric environment in which a company’s data can truly be which a company’s data can truly be thought of as a significant corporate thought of as a significant corporate resource. A key feature of this resource. A key feature of this environment is the ability to share data environment is the ability to share data among those inside and outside of the among those inside and outside of the company who require access to it. company who require access to it.

Page 11: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1111

The Database ConceptThe Database Concept

Data Integration and Data Redundancy - The Data Integration and Data Redundancy - The ability to achieve data integration while at the ability to achieve data integration while at the same time storing data in a nonredundant same time storing data in a nonredundant fashion. This, alone, is the central, defining fashion. This, alone, is the central, defining feature of the database approach.feature of the database approach.

Multiple RelationshipsMultiple Relationships - The ability to store data - The ability to store data representing entities involved in multiple representing entities involved in multiple relationships without introducing data relationships without introducing data redundancy or other structural problems. redundancy or other structural problems.

Page 12: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1212

The Database ConceptThe Database Concept

Data Control Issues - Data Control Issues - The establishment of The establishment of an environment that manages certain data an environment that manages certain data control issues, such as data security, control issues, such as data security, backup and recovery, and concurrency backup and recovery, and concurrency control.control.

Data IndependenceData Independence - The establishment of - The establishment of an environment that permits a high degree an environment that permits a high degree of data independence.of data independence.

Page 13: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1313

Data as a Manageable Data as a Manageable ResourceResource

Information systems environment:Information systems environment: HardwareHardware NetworksNetworks Applications softwareApplications software Systems softwareSystems software PeoplePeople DataData

Page 14: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1414

Data as a Manageable Data as a Manageable ResourceResource

Historically, data has not been the highest Historically, data has not been the highest priority concern.priority concern.

As the operational, tactical, and strategic As the operational, tactical, and strategic corporate levels became more dependent corporate levels became more dependent on information systems, data increasingly on information systems, data increasingly became recognized as an important became recognized as an important corporate resource.corporate resource.

Page 15: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1515

Data as a Manageable Data as a Manageable ResourceResource

The corporate community became The corporate community became increasingly convinced that a firm’s data increasingly convinced that a firm’s data could provide a significant competitive could provide a significant competitive advantage to the firm.advantage to the firm.

It became clear that data would have to be It became clear that data would have to be managed in an organized way.managed in an organized way.

Page 16: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1616

Data as a Manageable Data as a Manageable ResourceResource

Page 17: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1717

Data as a Manageable Data as a Manageable ResourceResource

Needed a Needed a software utilitysoftware utility that could that could manage and protect data.manage and protect data.

Data could be a critical Data could be a critical corporate corporate resourceresource..

Out of this need was born the database Out of this need was born the database management system.management system.

Page 18: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1818

Data as a Manageable Data as a Manageable ResourceResource

Reengineering - data is aggressively used Reengineering - data is aggressively used to redesign business processes.to redesign business processes.

Electronic commerce - a database at the Electronic commerce - a database at the heart of every web site; allows companies heart of every web site; allows companies and customers to conduct business.and customers to conduct business.

ERP systems - collections of application ERP systems - collections of application programs build around a central database.programs build around a central database.

Page 19: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-1919

Data Integration and Data Data Integration and Data RedundancyRedundancy

Data integration - the ability to tie together Data integration - the ability to tie together pieces of related data within an pieces of related data within an information system.information system.

Data redundancy - the same fact about the Data redundancy - the same fact about the business environment is stored more than business environment is stored more than once within an information system.once within an information system.

Page 20: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2020

Data Redundancy - ProblemsData Redundancy - Problems

Redundant data takes up a great deal of Redundant data takes up a great deal of extra disk space. extra disk space.

If the redundant data has to be updated, it If the redundant data has to be updated, it takes additional time to do so. This can be takes additional time to do so. This can be a major performance issue.a major performance issue.

There is the potential for data integrity There is the potential for data integrity problems.problems.

Page 21: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2121

Data IntegrityData Integrity

Refers to the accuracy of the data.Refers to the accuracy of the data.

Inaccurate data leaves the whole Inaccurate data leaves the whole information system of limited value.information system of limited value.

Page 22: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2222

Data Redundancy,Data Redundancy,Data IntegrityData Integrity

When all copies of redundant data are not When all copies of redundant data are not updated consistently, a data integrity problem updated consistently, a data integrity problem exists.exists.

Page 23: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2323

Three Files with Redundant Three Files with Redundant DataData

Page 24: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2424

Three Files with a Data Three Files with a Data Integrity ProblemIntegrity Problem

Page 25: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2525

General General Hardware Hardware Company Company FilesFiles

Page 26: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2626

General Hardware Company General Hardware Company Combined FileCombined File

0121 Main St.Hardware

137 New York 137 Baker 10 1995

0839 Jane’s Stores 186 Chicago 186 Adams 15 20010933 ABC Home

Stores137 Los Angeles 137 Baker 10 1995

1047 AcmeHardwareStore

137 Los Angeles 137 Baker 10 1995

1525 Fred’s ToolStores

361 Atlanta 361 Carlyle 20 2001

1700 XYZ Stores 361 Washington 361 Carlyle 20 20011826 City Hardware 137 New York 137 Baker 10 19952198 Western

Hardware204 New York 204 Dickens 10 1998

2267 Central Stores 186 New York 186 Adams 15 2001

Page 27: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2727

AnomaliesAnomalies

Typically occur in poorly structured files.Typically occur in poorly structured files.

Problems arise when two different kinds of Problems arise when two different kinds of data, like salesperson and customer data data, like salesperson and customer data are merged into one file.are merged into one file.

Page 28: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2828

AnomaliesAnomalies

Deletion Anomaly - e.g, if you delete a customer and that Deletion Anomaly - e.g, if you delete a customer and that record was the only one for a salesperson, the record was the only one for a salesperson, the salesperson’s data is gone.salesperson’s data is gone.

Insertion Anomaly - e.g., General Hardware cannot add Insertion Anomaly - e.g., General Hardware cannot add data about a new salesperson the company just hired data about a new salesperson the company just hired until she is assigned at least one customer.until she is assigned at least one customer.

Update Anomaly - redundant data in the database file Update Anomaly - redundant data in the database file must be updated each place it exists when it changesmust be updated each place it exists when it changes

Page 29: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-2929

Database Management Database Management SystemSystem

A software utility for storing and retrieving A software utility for storing and retrieving data that gives the end-user the data that gives the end-user the impression that the data is well integrated impression that the data is well integrated even though the data can be stored with even though the data can be stored with no redundancy at all.no redundancy at all.

Page 30: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3030

Multiple Relationships - Multiple Relationships - Horizontal SolutionHorizontal Solution

SalespersonNumber

SalespersonName

CommissionPercentage

YearOfHire

CustomerNumbers

137 Baker 10 1995 0121, 0933, 1047, 1826186 Adams 15 2001 0839, 2267204 Dickens 10 1998 2198361 Carlyle 20 2001 1525, 1700

(a) Salesperson file.

CustomerNumber

CustomerName HQ City

0121 Main St. Hardware New York0839 Jane’s Stores Chicago0933 ABC Home Stores Los Angeles1047 Acme Hardware Store Los Angeles1525 Fred’s Tool Stores Atlanta1700 XYZ Stores Washington1826 City Hardware New York2198 Western Hardware New York2267 Central Stores New York

(b) Customer file.

Page 31: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3131

Multiple Relationships - Multiple Relationships - Vertical SolutionVertical Solution

Page 32: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3232

Data Control IssuesData Control Issues

Data securityData security

Backup and RecoveryBackup and Recovery

Concurrency ControlConcurrency Control

Page 33: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3333

Computer SecurityComputer Security

A very broad topicA very broad topic Protecting the physical hardware Protecting the physical hardware

environmentenvironment Defending against hacker attacksDefending against hacker attacks Encrypting data transmitted over networksEncrypting data transmitted over networks Educating employees on the importance of Educating employees on the importance of

protecting the company’s dataprotecting the company’s data … … and many moreand many more

Page 34: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3434

Backup and RecoveryBackup and Recovery

Data can be lost or corrupted in any of a Data can be lost or corrupted in any of a variety of ways:variety of ways: a disaster such as a fire, a hurricane, or an a disaster such as a fire, a hurricane, or an

earthquakeearthquake hackershackers computer virusescomputer viruses poorly written application programspoorly written application programs unintentional errorunintentional error

Page 35: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3535

Concurrency ProblemConcurrency Problem

Updates to a database can interfere with Updates to a database can interfere with each other in such a way that the resulting each other in such a way that the resulting data values will be incorrect.data values will be incorrect.

A database management system must be A database management system must be designed to protect its databases from designed to protect its databases from such an eventuality.such an eventuality.

Page 36: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3636

Data IndependenceData Independence

Data Dependence - if for any reason the storage Data Dependence - if for any reason the storage characteristics of the data had to be changed, characteristics of the data had to be changed, the application program itself had to be modified, the application program itself had to be modified, often extensively.often extensively.

Data Independence - to have a data storage and Data Independence - to have a data storage and programming environment in which as many programming environment in which as many types of changes in the data structure as types of changes in the data structure as possible would not require changes in the possible would not require changes in the application programs that use them.application programs that use them.

Page 37: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3737

Major DBMS ApproachesMajor DBMS Approaches

Hierarchical - navigationalHierarchical - navigational

Network - navigationalNetwork - navigational

Relational - became commercially viable in Relational - became commercially viable in about 1980.about 1980.

Object-oriented - useful for a variety of niche Object-oriented - useful for a variety of niche applications.applications.

Page 38: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3838

Hierarchical & NetworkHierarchical & Network

Called navigational approaches because Called navigational approaches because of the way that programs have to of the way that programs have to “navigate” through hierarchies and “navigate” through hierarchies and networks of data to find needed data.networks of data to find needed data.

Developed in the 1960s andDeveloped in the 1960s and

Somewhat similar in structureSomewhat similar in structure

Page 39: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-3939

Hierarchical & NetworkHierarchical & Network

Suitable only for mainframe computersSuitable only for mainframe computers

were an elegant solution to the were an elegant solution to the redundancy/integration problem at the timeredundancy/integration problem at the time

Complex, difficult to workComplex, difficult to work

Now considered legacy systemsNow considered legacy systems

Page 40: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-4040

Relational DatabaseRelational Database

Became commercially viable in about Became commercially viable in about 19801980

Soon became the preferred DBMS Soon became the preferred DBMS approach and it has remained so ever approach and it has remained so ever since.since.

Page 41: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-4141

Object-orientedObject-oriented

Has proven useful for a variety of niche Has proven useful for a variety of niche applications applications

It is interesting to note that some of the It is interesting to note that some of the key object-oriented database concepts key object-oriented database concepts have found their way into some of the have found their way into some of the mainstream relational DBMSs and some mainstream relational DBMSs and some are described as taking a hybrid are described as taking a hybrid object/relational approach to database.object/relational approach to database.

Page 42: Chapter 4 The Database Management System Concept Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation

4-4-4242

“Copyright 2004 John Wiley & Sons, Inc. All rights reserved. Reproduction or translation of this work beyond that permitted in Section 117 of the 1976 United States Copyright Act without express permission of the copyright owner is unlawful. Request for further information should be addressed to the Permissions Department, John Wiley & Sons, Inc. The purchaser may make back-up copies for his/her own use only and not for distribution or resale. The Publisher assumes no responsibility for errors, omissions, or damages caused by the use of these programs or from the use of the information contained herein.”