ordinal attributes in data mining


PDF
List Docs
PDF Lecture Notes for Chapter 2 Introduction to Data Mining 2

An attribute is a property or characteristic of an object Examples: eye color of a person temperature etc Attribute is also known as variable field characteristic dimension or feature A collection of attributes describe an object – Object is also known as record point case sample entity or instance

PDF Data Mining Classification: Basic Concepts and Techniques

Classification: Definition Given a collection of records (training set ) – Each record is by characterized by a tuple (xy) where x is the attribute set and y is the class label x: attribute predictor independent variable input y: class response dependent variable output

  • What are attributes in data mining?

    A person’s hair colour, air humidity etc. An attribute set defines an object. The object is also referred to as a record of the instances or entity. In data mining, understanding the different types of attributes or data types is essential as it helps to determine the appropriate data analysis techniques to use.

  • What is ordinal data?

    Ordinal data represents qualitative data that can be ranked in a particular order. For instance, education level can be ranked from primary to tertiary, and social status can be ranked from low to high. In ordinal data, the distance between values is not uniform.

  • What is an ordinal attribute?

    An ordinal attribute is an attribute whose possible values have a meaningful order or ranking among them, but the magnitude between successive values is not known. However, to do so, it is important to convert the states to numbers where each state of an ordinal attribute is assigned a number corresponding to the order of attribute values.

Collection of data objects and their attributes

An attribute is a property or characteristic of an object Examples: eye color of a person, temperature, etc. Attribute is also known as variable, field, characteristic, dimension, or feature A collection of attributes describe an object – Object is also known as record, point, case, sample, entity, or instance www-users.cse.umn.edu

Attribute Values

Attribute values are numbers or symbols assigned to an attribute for a particular object www-users.cse.umn.edu

Distinction between attributes and attribute values

Same attribute can be mapped to different attribute values Example: height can be measured in feet or meters Different attributes can be mapped to the same set of values Example: Attribute values for ID and age are integers But properties of attribute can be different than the properties of the values used to represent the attribute www-users.cse.umn.edu

The type of an attribute depends on which of the following properties/operations it possesses:

Distinctness: =  Order: < > Differences are + -meaningful : Ratios are * / meaningful Nominal attribute: distinctness Ordinal attribute: distinctness & order Interval attribute: distinctness, order & meaningful differences Ratio attribute: all 4 properties/operations www-users.cse.umn.edu

Is it physically meaningful to say that a temperature of 10 ° is twice that of 5° on

the Celsius scale? the Fahrenheit scale? the Kelvin scale? www-users.cse.umn.edu

Consider measuring the height above average

If Bill’s height is three inches above average and Bob’s height is six inches above average, then would we say that Bob is twice as tall as Bill? Is this situation analogous to that of temperature? www-users.cse.umn.edu

This categorization of attributes is due to S. S. Stevens

This categorization of attributes is due to S. S. Stevens www-users.cse.umn.edu

Discrete Attribute

Has only a finite or countably infinite set of values Examples: zip codes, counts, or the set of words in a collection of documents Often represented as integer variables. Note: binary attributes are a special case of discrete attributes www-users.cse.umn.edu

Continuous Attribute

Has real numbers as attribute values Examples: temperature, height, or weight. Practically, real values can only be measured and represented using a finite number of digits. Continuous attributes are typically represented as floating-point variables. www-users.cse.umn.edu

Asymmetric Attributes

Only presence (a non-zero attribute value) is regarded as important Words present in documents Items present in customer transactions If we met a friend in the grocery store would we ever say the following? “I see our purchases are very similar since we didn’t buy most of the same things.” www-users.cse.umn.edu

Critiques of the attribute categorization

Incomplete Asymmetric binary Cyclical Multivariate Partially ordered Partial membership Relationships between the data Real data is approximate and noisy This can complicate recognition of the proper attribute type Treating one attribute type as another may be approximately correct www-users.cse.umn.edu

The types of operations you choose should be “meaningful” for the type of data you have

Distinctness, order, meaningful intervals, and meaningful ratios are only four (among many possible) properties of data The data type you see – often numbers or strings – may not capture all the properties or may suggest properties that are not present Analysis may depend on these other properties of the data Many statistical analyses depend only o

Important Characteristics of Data

Dimensionality (number of attributes) High dimensional data brings a number of challenges Sparsity Only presence counts Resolution Patterns depend on the scale Size Type of analysis may depend on size of data www-users.cse.umn.edu

Ordered

Spatial Data Temporal Data Sequential Data Genetic Sequence Data www-users.cse.umn.edu

Data Matrix

If data objects have the same fixed set of numeric attributes, then the data objects can be thought of as points in a multi-dimensional space, where each dimension represents a distinct attribute Such a data set can be represented by an m by n matrix, where there are m rows, one for each object, and n columns, one for each attribute www-users.cse.umn.edu

Transaction Data

A special type of data, where Each transaction involves a set of items. For example, consider a grocery store. The set of products purchased by a customer during one shopping trip constitute a transaction, while the individual products that were purchased are the items. Can represent transaction data as record data www-users.cse.umn.edu

Ordered Data

Genomic sequence data GGTTCCGCCTTCAGCCCCGCGCC CGCAGGGCCCGCCCCGCGCCGTC GAGAAGGGCCCGCCTGGCGGGCG GGGGGAGGCGGGGCCGCCCGAGC CCAACCGAGTCCGACCAGGTGCC CCCTCTGCTCGGCCTAGACCTGA GCTCATTAGGCGGCAGCGGACAG GCCAAGTAGAACACGCGAAGCGC TGGGCTGCCTGCTGCGACCAGGG www-users.cse.umn.edu

Data Quality

Poor data quality negatively affects many data processing efforts Data mining example: a classification model for detecting people who are loan risks is built using poor data Some credit-worthy candidates are denied loans More loans are given to individuals that default www-users.cse.umn.edu

Data Quality

What kinds of data quality problems? How can we detect problems with the data? What can we do about these problems? www-users.cse.umn.edu

Examples of data quality problems:

Noise and outliers Wrong data Fake data Missing values Duplicate data www-users.cse.umn.edu

Noise

For objects, noise is an extraneous object For attributes, noise refers to modification of original values Examples: distortion of a person’s voice when talking on a poor phone and “snow” on television screen The figures below show two sine waves of the same magnitude and different frequencies, the waves combined, and the two sine waves with random

Outliers

Outliers are data objects with characteristics that are considerably different than most of the other data objects in the data set Case 1: Outliers are noise that interferes with data analysis Case 2: Outliers are the goal of our analysis Credit card fraud Intrusion detection Causes? Missing Values Reasons for missing values Information is not coll

Share on Facebook Share on Whatsapp











Choose PDF
More..











ordinal variable example statistics ordinal variable examples ordre à cours limité achat ordre limité boursorama oregon oregon plan to reopen organic chemistry mcat prep organocopper reagents in organic synthesis

PDFprof.com Search Engine
Images may be subject to copyright Report CopyRight Claim

Is nominal  ordinal  \u0026 binary for quantitative data  qualitative

Is nominal ordinal \u0026 binary for quantitative data qualitative


attributes types in data mining

attributes types in data mining


Is nominal  ordinal  \u0026 binary for quantitative data  qualitative

Is nominal ordinal \u0026 binary for quantitative data qualitative


Nominal VS Ordinal Data: Definition  Examples and Difference

Nominal VS Ordinal Data: Definition Examples and Difference


What is Ordinal Data? Examples  Variables \u0026 Analysis

What is Ordinal Data? Examples Variables \u0026 Analysis


attributes types in data mining

attributes types in data mining


PDF) Mining rank-correlated sets of numerical attributes

PDF) Mining rank-correlated sets of numerical attributes


PDF) Manipulating measurement scales in medical statistical

PDF) Manipulating measurement scales in medical statistical


Lecture Notes for Chapter 2 Introduction to Data Mining - ppt download

Lecture Notes for Chapter 2 Introduction to Data Mining - ppt download


Notes-C-Language Pages 1 - 16 - Flip PDF Download

Notes-C-Language Pages 1 - 16 - Flip PDF Download


Lecture Notes for Chapter 2 Introduction to Data Mining - ppt

Lecture Notes for Chapter 2 Introduction to Data Mining - ppt


DATA_MINING_CONCEPTS_AND_TECHNIQUES_3RDpdf - Jiawei Han[DATA

DATA_MINING_CONCEPTS_AND_TECHNIQUES_3RDpdf - Jiawei Han[DATA


Data Mining-Spatial Data Mining

Data Mining-Spatial Data Mining


Types of Data: Nominal  Ordinal  Interval/Ratio - Statistics Help

Types of Data: Nominal Ordinal Interval/Ratio - Statistics Help


A dissimilarity measure for mixed nominal and ordinal attribute

A dissimilarity measure for mixed nominal and ordinal attribute


Notes-C-Language Pages 1 - 16 - Flip PDF Download

Notes-C-Language Pages 1 - 16 - Flip PDF Download


PDF) Ordinal  Continuous and Heterogeneous k -Anonymity Through

PDF) Ordinal Continuous and Heterogeneous k -Anonymity Through


Ordinal and Multinomial Models

Ordinal and Multinomial Models


IEEE 2010 IEEE 10th International Conference on Data Mining (ICDM

IEEE 2010 IEEE 10th International Conference on Data Mining (ICDM


42 Attribute Measurement Scales

42 Attribute Measurement Scales


Continuize — Orange Visual Programming 3 documentation

Continuize — Orange Visual Programming 3 documentation


Understanding Data Attribute Types

Understanding Data Attribute Types


Nominal VS Ordinal Data: Definition  Examples and Difference

Nominal VS Ordinal Data: Definition Examples and Difference


DOC) RO DATAMINING

DOC) RO DATAMINING


A dissimilarity measure for mixed nominal and ordinal attribute

A dissimilarity measure for mixed nominal and ordinal attribute


Nominal  Ordinal  Interval  Ratio Scales with Examples

Nominal Ordinal Interval Ratio Scales with Examples


PDF] A Simple Approach to Ordinal Classification

PDF] A Simple Approach to Ordinal Classification


CSE 634 Data Mining Techniques Professor Anita Wasilewska

CSE 634 Data Mining Techniques Professor Anita Wasilewska


PDF) Data Mining: Data

PDF) Data Mining: Data


Nominal vs Ordinal Scale: What is the Difference?

Nominal vs Ordinal Scale: What is the Difference?


ADDM Practical File[19203009]pdf - 1 Advanced Databases and Data

ADDM Practical File[19203009]pdf - 1 Advanced Databases and Data


Understanding Data Attribute Types

Understanding Data Attribute Types


Scales of Measurement - Nominal  Ordinal  Interval  Ratio (Part 1

Scales of Measurement - Nominal Ordinal Interval Ratio (Part 1


Simple Trick to Train an Ordinal Regression with any Classifier

Simple Trick to Train an Ordinal Regression with any Classifier


https://machinelearningmasterycom/transform-machine-learning-data-weka/

https://machinelearningmasterycom/transform-machine-learning-data-weka/


Nominal VS Ordinal Data: Definition  Examples and Difference

Nominal VS Ordinal Data: Definition Examples and Difference


Data mining :Concepts and Techniques Chapter 2  data

Data mining :Concepts and Techniques Chapter 2 data


Lecture Notes for Chapter 2 Introduction to Data Mining - ppt

Lecture Notes for Chapter 2 Introduction to Data Mining - ppt


book-20160121pdf - DATA MINING AND ANALYSIS Fundamental Concepts

book-20160121pdf - DATA MINING AND ANALYSIS Fundamental Concepts

Politique de confidentialité -Privacy policy