Limit search to available items
Book Cover
E-book
Author Adhikari, Animesh

Title Data analysis and pattern recognition in multiple databases / by Animesh Adhikari, Jhimli Adhikari, Witold Pedrycz
Published Cham : Springer, [2014]
©2014
Table of Contents
1.Introduction1
1.1.Motivation1
1.2.Distributed Data Mining3
1.3.Multi-Database Mining Approaches4
1.3.1.Local Pattern Analysis5
1.3.2.Sampling6
1.3.3.Re-Mining6
1.4.Pre-Processing of Databases6
1.4.1.Preparation of Data Warehouses7
1.4.2.Temporal Aggregation8
1.4.3.Partitioning Database8
1.4.4.Database Thinning9
1.4.5.Ordering of Databases9
1.4.6.Selection of Databases9
1.5.Patterns and Associations10
1.6.Related Studies13
1.7.Experimental Settings14
1.8.Conclusions14
 References16
2.Synthesizing Different Extreme Association Rules from Multiple Databases21
2.1.Introduction21
2.2.Some Extreme Types of Association Rule in Multiple Databases23
2.3.Problem Statement25
2.4.An Extended Model of Local Pattern Analysis for Synthesizing Global Patterns25
2.5.Related Work27
2.6.Synthesizing an Association Rule29
2.6.1.Design of Algorithm30
2.6.2.Error Calculation34
2.7.Experiments35
2.7.1.Results of Experimental Studies37
2.7.2.Comparison with Existing Algorithm38
2.8.Conclusions40
 References41
3.Clustering Items in Time-Stamped Databases Induced by Stability43
3.1.Introduction43
3.2.Related Work44
3.3.A Model of Mining Multiple Transactional Time-Stamped Databases45
3.4.Problem Statement47
3.5.Clustering Items49
3.5.1.Finding the Best Non-Trivial Partition50
3.5.2.Finding a Best Class54
3.6.Experiments55
3.7.Conclusions58
 References59
4.Synthesizing Global Patterns in Multiple Large Data Sources61
4.1.Introduction61
4.2.Multi-Database Mining Using Local Pattern Analysis62
4.3.Generalized Multi-Database Mining Techniques63
4.3.1.Local Pattern Analysis63
4.3.2.Partition Algorithm64
4.3.3.IdentifyExPattern Algorithm64
4.3.4.RuleSynthesizing Algorithm64
4.4.Specialized Multi-Database Mining Techniques65
4.4.1.Mining Multiple Real Databases65
4.4.2.Mining Multiple Databases for the Purpose of Studying a Set of Items66
4.4.3.Study of Temporal Patterns in Multiple Databases66
4.5.Mining Multiple Databases Using Pipelined Feedback Technique66
4.5.1.Algorithm Design68
4.6.Error Evaluation68
4.7.Experiments69
4.8.Conclusions73
 References73
5.Clustering Local Frequency Items in Multiple Data Sources75
5.1.Introduction75
5.2.Related Work77
5.2.1.Measures of Association77
5.2.2.Multi-Database Mining Techniques78
5.2.3.Clustering Techniques81
5.3.Problem Statement82
5.4.Synthesizing Support of an Itemset83
5.5.Clustering Local Frequency Items86
5.5.1.Finding the. Best Non-Trivial Partition89
5.5.2.Error Analysis93
5.6.Experimental Results94
5.6.1.Overall Output96
5.6.2.Synthesis of High Frequency Itemsets97
5.6.3.Error Quantification99
5.6.4.Average Error Versus γ100
5.6.5.Average Error Versus α101
5.6.6.Clustering Time Versus Number of Databases103
5.6.7.Comparison with Existing Technique104
5.7.Conclusions106
 References107
6.Mining Patterns of Select Items in Different Data Sources109
6.1.Introduction109
6.2.Mining Global Patterns of Select Items111
6.3.Overall Association Between Two Items in a Database113
6.4.An Application: Study of Select Items in Multiple Databases Through Grouping116
6.4.1.Properties of Different Measures118
6.4.2.Grouping Frequent Items120
6.4.3.Experiments123
6.5.Related Work127
6.6.Conclusions128
 References128
7.Synthesizing Global Exceptional Patterns in Different Data Sources131
7.1.Introduction131
7.2.Exceptional Patterns in Multiple Data Sources133
7.3.Problem Statement139
7.4.Related Work139
7.5.Synthesizing Support of an Itemset140
7.6.Synthesizing Type II Global Exceptional Itemsets141
7.7.Error Calculation145
7.8.Experiments146
7.8.1.Comparison with the Existing Algorithm151
7.9.Conclusions153
 References153
8.Mining Icebergs in Different Time-Stamped Data Sources157
8.1.Introduction157
8.2.Related Work159
8.3.Notches in Sales Series160
8.3.1.Identifying Notches162
8.4.Generalized Notch162
8.5.Iceberg Notch163
8.6.Sales Series164
8.6.1.Another View at Sales Series164
8.6.2.Other Applications of Icebergs165
8.7.Mining Icebergs in Time-Stamped Databases166
8.7.1.Non-Incremental Approach166
8.7.2.Incremental Approach169
8.8.Significant Year171
8.9.Experimental Studies172
8.10.Conclusions180
 References180
9.Mining Calendar-Based Periodic Patterns in Time-Stamped Data183
9.1.Introduction183
9.2.Related Work186
9.3.Calendar-Based Periodic Patterns187
9.3.1.Extending Certainty Factor188
9.3.2.Extending Certainty Factor with Respect to Other Intervals191
9.4.Mining Calendar-Based Periodic Patterns192
9.4.1.Improving Mining Calendar-Based Periodic Patterns193
9.4.2.Data Structure193
9.4.3.A Modified Algorithm195
9.5.Experimental Studies198
9.5.1.Selection of Mininterval and Maxgap202
9.5.2.Selection of Minsupp204
9.5.3.Performance Analysis205
9.6.Conclusions207
 References207
10.Measuring Influence of an Item in Time-Stamped Databases209
10.1.Introduction209
10.2.Association Between Two Itemsets211
10.3.Concept of Influence212
10.3.1.Influence of an Itemset on Another Itemset213
10.3.2.Properties of Influence Measures214
10.3.3.Influence of an Item on a Set of Specific Items215
10.3.4.Motivation216
10.4.Problem Statement218
10.5.Related Work219
10.6.Design of Algorithms219
10.6.1.Designing Algorithm for Measuring Overall Influence of an Item on Another Item220
10.6.2.Designing Algorithm for Measuring Overall Influence of an Item on Each of the Specific Items221
10.6.3.Designing Algorithm for Identifying Top Influential Items on a Set of Specific Items221
10.7.Experiments222
10.8.Conclusions228
 References228
11.Summary and Conclusions231
11.1.Changing Scenarios231
11.2.Summary of Chapters232
11.3.Selected Open Problems and Challenges234
11.4.Conclusions234
 References235
 Index237

Copies

Description 1 online resource (247 pages) : illustrations
Series Intelligent Systems Reference Library, 1868-4394 ; 61
Intelligent systems reference library ; v. 61.
Contents From the Contents: Synthesizing Different Extreme Association Rules in Multiple Data Sources -- Clustering items in time-stamped databases induced by stability -- Mining global patterns in multiple large databases -- Clustering Local Frequency Items in Multiple Data Sources -- Mining Patterns of Select Items in Different Data Sources
Summary Pattern recognition in data is a well known classical problem that falls under the ambit of data analysis. As we need to handle different data, the nature of patterns, their recognition and the types of data analyses are bound to change. Since the number of data collection channels increases in the recent time and becomes more diversified, many real-world data mining tasks can easily acquire multiple databases from various sources. In these cases, data mining becomes more challenging for several essential reasons. We may encounter sensitive data originating from different sources - those cannot be amalgamated. Even if we are allowed to place different data together, we are certainly not able to analyse them when local identities of patterns are required to be retained. Thus, pattern recognition in multiple databases gives rise to a suite of new, challenging problems different from those encountered before. Association rule mining, global pattern discovery, and mining patterns of select items provide different patterns discovery techniques in multiple data sources. Some interesting item-based data analyses are also covered in this book. Interesting patterns, such as exceptional patterns, icebergs and periodic patterns have been recently reported. The book presents a thorough influence analysis between items in time-stamped databases. The recent research on mining multiple related databases is covered while some previous contributions to the area are highlighted and contrasted with the most recent developments
Bibliography Includes bibliographical references and index
Notes English
Online resource; title from PDF title page (ebrary, viewed January 8, 2014)
Subject Data mining.
Engineering.
Optical pattern recognition.
Data Mining
Engineering
engineering.
COMPUTERS -- General.
Ingénierie.
Data mining
Engineering
Optical pattern recognition
Form Electronic book
Author Adhikari, Jhimli
Pedrycz, Witold, 1953-
ISBN 9783319034102
3319034103
331903409X
9783319034096