Chapter 27: Problem 16

For the Partition algorithm, prove that any frequent itemset in the database must appear as a local frequent itemset in at least one partition.

Short Answer

Expert verified

According to Partition Algorithm principle, any frequent itemset in the whole database should appear as a local frequent itemset in at least one partition because the database is a superset of all partitions, and each transaction in the database belongs to exactly one partition. Therefore, if an itemset is frequent in the database, it must be frequent in at least one of the partitions.

Step by step solution

Understanding the Problem Terminology

An itemset is considered 'frequent' if its support (probability of occurrence) in the dataset is no less than a predetermined minimum support threshold. A 'local frequent itemset' is a frequent itemset in a partition of the data.

Partition Algorithm Basic Principle

The basic principle of the Partition algorithm is that it scans the entire dataset once and divides it into non-overlapping subsets or partitions. Each partition fits into main memory, allowing for local frequent itemsets to be generated. Given the constraint of the Partition algorithm, each transaction in the database belongs to exactly one partition.

Proving the Assertion

It is stated that every frequent itemset in the entire database must exist as a local frequent itemset in at least one partition. This holds true because if an itemset is 'frequent' in the entire database, it means its support is above the minimum support threshold in the full data. Since the database is a superset of all partitions, but each transaction can belong to only one partition, that itemset has to appear frequently in at least one partition. Thus, if an itemset does not appear as a local frequent itemset in any partition, it cannot be a frequent itemset in the entire database.

Counter-Argument for a Clearer Picture

To affirm the understanding, consider a counterargument. Assume that a frequent itemset does not appear as a local frequent itemset in any partition. But because the frequent itemset in the full set is constituted by transactions from some (or all) partitions, the absence of this itemset in any partition means it's not frequent in the database. This contradicts the initial assumption, hence the counterargument confirms our theory.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Start your free trial

Over 30 million students worldwide already upgrade their learning with Vaia!

Recommended explanations on Computer Science Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

For the Partition algorithm, prove that any frequent itemset in the database must appear as a local frequent itemset in at least one partition.

Short Answer

Step by step solution

Understanding the Problem Terminology

Partition Algorithm Basic Principle

Proving the Assertion

Counter-Argument for a Clearer Picture

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Computer Science Textbooks

Databases

Functional Programming

Computer Programming

Data Representation in Computer Science

Problem Solving Techniques

Computer Organisation and Architecture

Study anywhere. Anytime. Across all devices.