Your browser doesn't support javascript.
SCORER-Gap: Sequentially Correlated Rules for Event Recommendation Considering Gap Size
21st IEEE International Conference on Data Mining Workshops, ICDMW 2021 ; 2021-December:925-934, 2021.
Article in English | Scopus | ID: covidwho-1730939
ABSTRACT
Finding significant events, which follow a specific pattern, is an essential task in sequential rule mining. While the significance of a rule often is based on conditions like a maximum amount of time [1], or a minimum distance between patterns [2], the area between these two extremes is rarely analyzed. This paper aims at the discovery of partially-ordered sequential rules which satisfy a given correlation gap constraint. Applying this constraint to the support threshold determines a more relevant rule, among other parameters. We also require it in sparse datasets, where long sequences with many distinct events exist. This setting can be found in online product configurators, where the basis is an unstructured process that combines both high-level and fine-grained configuration steps. In general, our novel approach SCORER-Gap can be applied to procedures with a high variability of events.By focusing on the gap size between antecedent and consequent of a rule, we show that usually, the resulting vast number of rules gets highly reduced while keeping the flexibility between a minimum and a maximum distance in between. To implement our novel approach, we use an in-mining setup, namely RuleGrowth [1] to which we attach the correlation gap constraint as mentioned above. The code is available on [3]. For an extensive analysis of application areas, we use three real-world datasets consisting of different characteristics. We start with a Covid19 genome sequence representing a highly dense dataset. Additionally, an industrial database and the clickstream of a Hungarian news website (Kosarak) are used as representatives for increasingly sparse datasets.SCORER-Gap shows a high percentual reduction in the number of rules in the resulting ruleset while slightly increasing accuracy in a train and test setting. Furthermore, a high proportion of recommendation rules differs between RuleGrowth and SCORER-Gap. © 2021 IEEE.
Keywords

Full text: Available Collection: Databases of international organizations Database: Scopus Type of study: Prognostic study Language: English Journal: 21st IEEE International Conference on Data Mining Workshops, ICDMW 2021 Year: 2021 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: Databases of international organizations Database: Scopus Type of study: Prognostic study Language: English Journal: 21st IEEE International Conference on Data Mining Workshops, ICDMW 2021 Year: 2021 Document Type: Article