List of Underlying Concepts

The following is the current version of the underlying concepts for each of the Conformed Dimensions of Data Quality (r4.3). The definition of each of the dimensions is available here.

Click on either the Dimension, (e.g. Completeness) or the Underlying Concept (e.g. Record Population) in order to see a list of blogs covering those topics. Alternatively use the site search box (top right) to search for key words you're interested in finding (including any of the dimensions or underlying concepts).

Conformed Dimension

Underlying Concepts

Definition of Underlying Concept

Completeness

Record Population

This measures whether a row is present in a data set (table).

Attribute Population

This measures whether a value is present (not null) for an attribute (column).

Truncation

This measures whether the value contains all characters of the correct value.

Existence

Existence identifies whether a real-life fact has been captured as data.

Accuracy

Agree with Real-worldDegree that data factually represents its associated real-world object, event, or concept.

Match to Agreed Source

Measure of agreement between data and the source of that data. This is used when the data represent intangible objects or transactions that can't be observed visually.

Consistency

Equivalence of Redundant or Distributed DataThe measure of similarity with other sources of data that represent the same concept.
Format ConsistencyThis measures the conformity of format of the same data in different places.
Logical ConsistencyLogical consistency measures whether two attributes of related data are conceptually in agreement, even though they may not record the same characteristic of a fact.

Temporal Consistency

The measure of uniformity of the data compared to historical values.

Validity

Values in Specified RangeValues must be between some lower number and some higher number.
Values Conform to Business RuleValidity measures whether values adhere to some declarative formula.

Domain of Predefined Values

This is a set of permitted values.
Values Conform to Data TypeValidity measures whether values have a specific characteristic (e.g. Integer, Character, Boolean). Data types restrict what values can exist, the operations that can be use on it, and the way that the data is stored.
Values Conform to FormatValidity measures whether the data are arranged or composed in a predefined way.

Timeliness

Time Expectation for Availability

The measure of time between when data is expected versus made available.

Manual Float

Manual float is a measure of the time from when an observation is made to the point it is recorded in electronic format.

Electronic Float

Electronic float is a measure of the time from when data is captured in an electronic format until it is accessed by a person.

Currency

Current with World it Models

Data is current if it reflects the present state of the concept it models.

Accessibility

Ease of Obtaining DataThis measures how easy it is to obtain data.
Access ControlAccess control includes the identification of a person that wants to access data, authentication of their identity, review and authorization to access required data, and lastly auditing the access of that data.
RetentionRetention refers to the period of time that data is kept before being removed from a database through purge or archive processing.

Integrity

Referential Integrity

Referential integrity measures whether if when a value (foreign key) is used it must reference an existing key (primary key) in the parent table.

Uniqueness

Uniqueness measures whether each fact is uniquely represented.

Cardinality

Cardinality describes the relationship between one data set and another, such as one-to-one, one-to-many, or many-to-many.

Precision

Precision of Data Value

The measure of preciseness of numeric data using decimal places, rounding and truncation.

Granularity

The detail or summary of data defines the granularity measured by the number of attributes used to represent a single concept.

Domain Precision

Domain Precision is the granularity for which a concept is represented as an attribute.

Lineage

Source Documentation

Source documentation provides data provenance which describes the origin of the data.

Segment Documentation

Segment documentation provides how data is transformed and transported from one location to another.

Target Documentation

Documentation about the target explains where the data moved to and how it is stored.

End-to-End Graphical Documentation

End-to-End documentation provides diagrammatic visual representation of how the data flows from beginning to end.

Representation

Easy to Read and Interpret

Illustrations and charts should be self-explanatory and presented with appropriate labels, providing context.

Presentation Language

Data that is represented well is simple but elegantly formed with good grammar and presented in a standard way.

Media Appropriate

The appropriate media (e.g. Web-based, hardcopy, or audio…etc) are provided.

Metadata Availability

Comprehensive descriptions and other information about the characteristics of the data are provided in plain language.

Includes Measurement Units

Well represented data includes the scale of measurement, such as weight, height, distance…etc.

 

image
This work is licensed under a Creative Commons Attribution 4.0 International License.