In GIS uncertainty and quality issues are much more broad In addition to positional accuracy there is: attribute accuracy, completeness of the data, sources and
Previous PDF | Next PDF |
[PDF] Attribute Data Input and Management
Attribute data describe the characteristics of the map AGS 722 Linking Attribute Data and Spatial Data ▫Defined by data types allowed in GIS package
[PDF] Chapter 8 ATTRIBUTE DATA INPUT AND - Fresno State
8 1 Attribute Data in GIS 8 1 1 Type of Attribute Table 8 1 2 Database Management 8 1 3 Type of Attribute Data Box 8 1 Categorical and Numeric Data 8 2 The
[PDF] Présentation PowerPoint - University of New Brunswick
Visualization Methods and GIS Functionality Data and Querying • The database attribute structure of these datasets allows for a user to query spatial
[PDF] Chapter 4 Spatial data & spatial DB sys 1 Introduction spatial DB
1) spatial data : data of spatial attributes that denote a location / near the surface OGC geometry object model (Open GIS Simple Feature Specification for SQL) spatially extended SQL consists of 2 parts : query language + presentation
[PDF] ECUS Presentation - Esri
18 nov 2015 · Model Real-Time Enterprise Data Imagery Web Maps Web Scenes GIS Maps Data Portal Attributes Construction Multifeature
[PDF] Chapter 7: Methods for GIS Data Manipulation, Analysis, and
importance For organizational and presentation purposes, this chapter describes the methods of each For this driver, the team analyzed GIS data that Selected all wetland areas from the attribute table and assigned them a score of 10 All
[PDF] GIS DATA MODELS
types of data in GIS; spatial data and non-spatial data (Attribute data) Non- spatial Data Model In order to represent the spatial information and their attributes, a data model unrealistic presentation unless the resolution is sufficiently high
[PDF] Principles of GIS
(Satellite images, aerial photos, etc ) Representation of Geographic Data Page 17 PRESENTATION GRAPHICS ○ Thematic mapping is a
[PDF] Chapter 1
In GIS uncertainty and quality issues are much more broad In addition to positional accuracy there is: attribute accuracy, completeness of the data, sources and
[PDF] attribute data management in gis ppt
[PDF] attribute in statistics
[PDF] attribute measures
[PDF] attribute of database
[PDF] attribute types
[PDF] attributes and methods in java
[PDF] attributes dataset h5py
[PDF] attributes dataset js
[PDF] attributes dataset python
[PDF] attributes of data mining
[PDF] attributes of data warehouse
[PDF] attributes of dataframe
[PDF] attributes of dataframe in python
[PDF] attributes of dataframe pandas
1
Data Quality
andError Analysis in GIS
Joshua Greenfeld, PhD, LS Professor emeritus, NJIT Professor, Israel Institute of Technology Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 1ABSTRACT
One of the major challenges of GIS is dealing with the uncertainty and the assessment of the quality of spatial information.The challenge is to assess the quality of spatial
information not just the quality of spatial data.Many professionals are involved in providing GIS
services. Surveying is only one of them. For surveying to make a mark on the GIS industry and become a prominent stake holder of GIS, it has to offer some expertise that most other professionals cannot. Unfortunately, the ability to collect spatial data is becoming a common skill and the surveyors positioning expertise is not as unique as it used to be. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 2ABSTRACT
There is one area that surveyors have an advantage over other GIS professionals is their propensity and ability to understand and quantify spatial errors and accuracies. In surveying, the uncertainty and quality assessment is mostly confined to positioning or positional accuracies. The quality of surveying results is typically assessed on the basis of measurement accuracy and the propagation of these accuracies into other computed quantities. In GIS uncertainty and quality issues are much more broad. In addition to positional accuracy there is: attribute accuracy, completeness of the data, sources and lineage of the data, logical consistency, fuzziness of the spatial phenomenon, currency of the data and other uncertainty issues. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 3Objective
The objective of this seminar is to enable surveyors to understand the broader issues of accuracy assessment beyond positional accuracies. It will outline the extended definition of uncertainty and quality as it applies to GIS. It will include an overview on the errors and uncertainties that could impact the quality of spatial data. This will be followed by discussing the impact of errors in spatial data on spatial information. The ISO geospatial standards will be reviewed as well. Finally, some practical tools and examples of numerical and statistical assessment of uncertainty and quality of spatial information will be discussed and demonstrated. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 4 2Importance of Quality
Gain confidence in geodata
Minimize consecutive costs caused by decisions
or actions based on erroneous data Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 5No unified definition of data quality
1. Data Quality refers to the degree of excellence
exhibited by the data in relation to the portrayal of the actual phenomena. GIS Glossary2. The state of completeness, validity, consistency,
timeliness and accuracy that makes data appropriate for a specific use. Government of British Columbia3. The totality of features and characteristics of data
that bears on their ability to satisfy a given purpose; the sum of the degrees of excellence for factors related to data. Glossary of Quality Assurance Terms Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 6No unified definition of data quality
4. Information Quality : the fitness for use of
information; information that meets the requirements of its authors, users, and administrators. (Martin Eppler)5. Data quality: The processes and technologies
involved in ensuring the conformance of data values to business requirements and acceptance criteria6.ISO/PAS 26183:2006 defines product data quality as
a measure of the accuracy and appropriateness of product data, combined with the timeliness with which those data are provided to all the people who need them. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 7Error and Uncertainty in GIS
One of the major problems currently existing within GIS is the aura of accuracy surrounding digital geographic data Often hardcopy map sources include a map reliability rating or confidence rating in the map legend This rating helps the user in determining the fitness for use for the map However, rarely is this information encoded in the digital conversion process Often because GIS data is in digital form and can be represented with a high precision it is considered to be totally accurate Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 8 3Error and Uncertainty in GIS
In reality, a buffer exists around each feature which represents the actual positional location of the featureFor example, data captured at the 1:20,000 scale
commonly has a positional accuracy of ± 20 metres This means the actual location of features may vary 20 metres in either direction from the identified position of the feature on the map Considering that the use of GIS commonly involves the integration of several data sets, usually at different scales and quality, one can easily see how errors can be propagated during processing Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 9Error and Uncertainty in GIS
The ease with which geographic data in a GIS can be used at any scale highlights the importance of detailed data quality information.Although a data set may not have a specific scale
once it is loaded into the GIS database, it was produced with levels of accuracy and resolution that make it appropriate for use only at certain scales, and in combination with data of similar scales. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 10Error and Uncertainty in GIS
Error - Two sources of error:
Inherent and Operational
Inherent error is the error present in source
documents and data Operational error is the amount of error produced through the data capture and manipulation functions of a GIS Both contribute to the reduction in quality of the products that are generated by GIS. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 11Error and Uncertainty in GIS
Possible sources of operational errors include :Mislabelling of areas on thematic maps
Misplacement of horizontal (positional)
boundariesHuman error in digitizing classification error
GIS algorithm inaccuracies
human bias While error will always exist in any scientific process, the aim within GIS processing should be to identify existing error in data sources and minimize the amount of error added during processing Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 12 4Errors in Database Creation
Errors are introduced at almost every step of database creation Concerns the degree to which the data exhausts the universe of possible itemsAre all possible objects included within the
database? Affected by rules of selection, generalization and scale Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 13Error and Uncertainty in GIS
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 14 Error induced by data cleaning, Longley et al., chapter 6, pages132-133
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 15Merging. Longley et al., chapter 6, pages 132-133
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 16 5 Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 17 classification error -- difference in pixel class between the map and a reference 19391956
1971
1995
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 19
Error and Uncertainty in GIS
Because of cost constraints it is often more appropriate to manage error than attempt to eliminate it! There is a trade-off between reducing the level of error in a data base and the cost to create and maintain the database An awareness of the error status of different data sets will allow user to make a subjective statement on the quality and reliability of a product derived from GIS processing The validity of any decisions based on a GIS product is directly related to the quality and reliability rating of the product Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 20 6Error and Uncertainty in GIS
Depending upon the level of error inherent in the source data, and the error operationally produced through data capture and manipulation, GIS products may possess significant amounts of error Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 21Error and Uncertainty in GIS
Tools to get a handle on uncertainty
Models of uncertainty: methods for assessing and
describing errorError propagation (during analysis)
Fuzzy approaches (membership of classes)
Sensitivity analysis (effect of errors)
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 22Error and Uncertainty in GIS
Error assessment, reporting, interpretation - more difficultQuality of data: standards and metadata
But: No professional GIS currently in use can present the user with information about the confidence limits that should be associated with the results of an analysis. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 23Classification of Errors in GIS
Resulting in
Forms of Error
Source of Error Data Collection and Compilation Data Processing Data UsagePositional Error Logical Error
Attribute Error Completeness
(Primary) (Secondary)Final Product Errors
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 247
Uncertainty
Uncertainties in geographic information originate from different sources: Uncertainty due to the inherent nature of geography: different interpretations can be equally valid; Cartographic uncertainty resulting in positional and attribute errors; Conceptual uncertainty as a result of differences in being mapped Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 25Uncertainty (Definition of a Forest)
0 2 4 6 8 10 12 14 160102030405060708090
Tree Height (m)
Canopy Coverage (%)
Portugal
Mexico
U.S. Israel
Belgium Malaysia UN
Turkey
Estonia
Switzerland
Somalia New Zealand UNESCO Australia Japan
Denmark
Morocco
KenyaZimbabwe
SudanTanzania
Ethiopia
South Africa
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 26Internal and External Data Quality
internal quality - Corresponds to the level of similarity that external quality - Corresponds to the similarity between the data produced and user needsData that should have been produced
Data produced
User needs 1
User needs 2
User needs n
Internal
Quality
External
Quality 2
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 27Characteristics to define the
internal qualityCompleteness: presence and absence of features,
their attributes and relationships. Logical consistency: degree of adherence to logical rules of data structure, attribution, and relationships (data structure can be conceptual, logical or physical).Positional accuracy: accuracy of the position of
features.Temporal accuracy: accuracy of the temporal
attributes and temporal relationships of features. Thematic accuracy: accuracy of quantitative attributes and the correctness of non-quantitative attributes and of the classifications of features and their relationships. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 288 six characteristics to define the external quality (Beard and Vallière) Definition: to evaluate whether the exact nature of a corresponds to user needs (semantic, spatial and temporal definitions). Coverage: to evaluate whether the territory and the
Lineage: to find out where data come from, their
acquisition objectives, the methods used to obtain them, data meet user needs. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 29six characteristics to define the external quality (Beard and Vallière) Precision: to evaluate what data is worth and whether it is acceptable for an expressed need (semantic, temporal, and spatial precision of the object and its attributes). Legitimacy: to evaluate the official recognition and the legal scope of data and whether they meet the needs of de facto standards, respect recognized standards, have legal or administrative recognition by an official body, or legal guarantee by a supplier, etc.; Accessibility: to evaluate the ease with which the user can obtain the data analyzed (cost, time frame, format, confidentiality, respect of recognized standards, copyright, etc.). Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 30
Conceptual model of
uncertainty in spatial dataUncertainty
Poorly Defined Objects
Well Defined Objects
ErrorVagueness
Probability
Fuzzy Set Theory
Ambiguity
Discord
Non-Specifity
Expert Opinion Dempster Schafer
Endorsement Theory, Fuzzy Set Theory
Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 31Definitions of geographic objects
An examples of well-defined geographical objects is land ownership. The boundary between land parcels is commonly marked on the ground, and shows an abrupt and total change in ownership Examples of poorly defined geographical objects are the rule in natural resource mapping. The conceptualization of mappable phenomena and the spaces they occupy is rarely clear-cut There are rarely sharp transitions from one vegetation type to another In a region there could be several types of vegetation Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 329
Five dimensions of objects A and B
Relation
Scale Space TimeAttribute
B A Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 33Error Ideally, if an object is conceptualized as being definable in both attribute and spatial dimensions, then it has a Boolean occurrence; any location is either part of the object, or it is not. Within GIS, for a number of reasons, a location or the assignment of an object to a location or to the a class may be expressed as a probability. Data Quality and Error Analysis in GIS (c) Dr. J. Greenfeld 34