Loading...
Please wait, while we are loading the content...
Similar Documents
Dynamic Miss-Counting Algorithms: Finding Implication and Similarity Rules with Confidence Pruning (1999)
| Content Provider | CiteSeerX |
|---|---|
| Author | Fujiwara, Shinji Ullman, Jeffrey D. Motwani, Rajeev |
| Description | Dynamic Miss-Counting (DMC) algorithms are proposed, which find all implication and similarity rules with confidence pruning but without support pruning. To handle data sets with a large number of columns, we propose dynamic pruning techniques that can be applied during data scanning. DMC counts the numbers of rows in which each pair of columns disagree instead of counting the number of hits. DMC deletes a candidate as soon as the number of misses exceeds the maximum number of misses allowed for that pair. We also propose several optimization techniques that reduce the required memory size significantly. We evaluated our algorithms by using 4 data sets, i.e., Web access logs, Web page-link graph, News documents, and a Dictionary. These data sets have between 74,000 and 700,000 items. Experiments show that DMC can find high-confidence rules for such a large data sets efficiently. 1 Introduction Finding implication and similarity rules are two of the most interesting issues in the data ... |
| File Format | |
| Language | English |
| Publisher Date | 1999-01-01 |
| Access Restriction | Open |
| Subject Keyword | Similarity Rule Web Access Log High-confidence Rule Support Pruning Required Memory Size Data Set Introduction Finding Implication Dynamic Pruning Technique Web Page-link Graph Dynamic Miss-counting Large Number Interesting Issue Maximum Number In ICDE Large Data Set News Document Several Optimization Technique Dynamic Miss-counting Algorithm |
| Content Type | Text |
| Resource Type | Article |