Sequential Pattern Mining
Group 8
Pham Van Hung Do Ba Huy Tran Vi Khang Le Dang Khoa
Do Phuc Kien Nguyen Duc Lap Le Tran Bao Loi
University of Information Technology
November 11, 2024
Group 8 (UIT) Sequential Pattern Mining November 11, 2024 1 / 44
Overview
1. What is Sequential Pattern Mining ?
2. Applications of Sequence Pattern Mining
3. Methods for Sequential Pattern Mining
4. Experimental Results
5. Demo
Group 8 (UIT) Sequential Pattern Mining November 11, 2024 2 / 44
What is Sequential Pattern Mining ?
Definition
Sequential Pattern Mining is a popular data mining task, introduced in 1994 by Agrawal
and Srikant.
Given a set of data sequences and support threshold, the problem is to find the complete
set of frequent subsequences
Group 8 (UIT) Sequential Pattern Mining November 11, 2024 3 / 44
Support threshold
Support Threshold: is the minimum support value required for an item set to be considered
significant or frequent in the analysis. Example: A sequence database has min sup = 2.
Sequence id Sequence
1(ab)ca
2(ab)bc
3bcd
4b(ab)c
acwith support is 3 =>a sequential pattern.
Group 8 (UIT) Sequential Pattern Mining November 11, 2024 4 / 44
Subsequence
Subsequence: Given α=a1a2...anand β=b1b2...bm,αcalled a subsequence of β,
denote as αwhen 1 j1<j2< ... < jnmsuch that a1bj1,a2bj2, ..., anbjn.
Example:
(ac)⟩⊆⟨(abc)
(ac) ⊆ ac
(a)(c) (ab)d(bc)
Group 8 (UIT) Sequential Pattern Mining November 11, 2024 5 / 44