Skip to main content

UIC-Miner Task

The UIC-Miner (Uplift in Categories Miner) finds conditions under which rare categories in a target attribute are proportionally boosted. It is designed for situations where a valuable but infrequent outcome — such as a fatal accident, a high-value sale, or a rare disease — needs to be identified and amplified despite its low base occurrence.

The core idea: rather than searching for conditions where a rare category appears alone (which may produce too few results to be actionable), UIC-Miner mixes the rare category with a related, more common one and searches for conditions where the combined weighted improvement is above a defined threshold.

Task creation is handled through a three-step wizard: Task Setup → Logic Configuration → Quantifiers.


Step 1 — Task Setup

UIC-Miner Step 1 - Task Setup

The first step captures the basic task information:

FieldDescription
Task NameA name to identify this task
Procedure MethodSelect UICMiner
DatasetThe dataset to mine — select from your uploaded datasets
ProjectOptionally assign the task to a project (can be left empty)

Click Next Step to proceed to logic configuration.


Step 2 — Logic Configuration

UIC-Miner Step 2 - Logic Configuration

The UIC-Miner has two cedent tabs — Antecedent (IF) and Condition (Filter) — plus a Target Attribute selector shared with the CF-Miner.

Target Attribute

Select the categorical column whose category distribution you want to analyse and boost. The categories of this attribute are what the AAD Weights quantifier in Step 3 will be applied to.

Antecedent (IF)

The antecedent defines the conditions the miner explores. For each antecedent found, the miner evaluates the histogram of the target attribute within that subgroup and compares it to the baseline (either the full dataset or the condition subset).

Configure it the same way as any other cedent:

  • Cedent Type — toggle between Conjunction (AND) or Disjunction (OR) using the Switch Type button
  • Cedent Length (Min / Max) — controls how many attributes can be combined in a single antecedent

For each attribute added:

FieldDescription
ColumnSelect an attribute from the dataset
TypeHow the attribute's values are grouped — see Literal Types below
Min / MaxThe minimum and maximum number of values to combine for this attribute

Condition (Filter)

An optional filter that restricts the dataset to a subset before mining begins. When set, the antecedent's histogram is compared against the condition subset rather than the full dataset.

Literal Types

TypeDescription
subsetAny subset of the attribute's categories (unordered)
seqSequences of consecutive ordered values
lcutLeft cut — takes values from the left end of the ordered range
rcutRight cut — takes values from the right end of the ordered range

Click Next Step to proceed to quantifier setup.


Step 3 — Quantifiers

UIC-Miner Step 3 - Quantifiers

UIC-Miner uses a weighted scoring mechanism to rank how interesting a discovered antecedent is. Only rules meeting all specified conditions are returned.

Base

QuantifierDescription
BaseMinimum number of records satisfying the antecedent
Relative BaseMinimum base as a fraction of the total dataset size

Confidence (AAD Score)

QuantifierDescription
AAD ScoreMinimum weighted improvement score. Measures the overall uplift across all target categories, weighted by importance. Higher means more interesting rules.
AAD WeightsA comma-separated weight vector — one value per target category, in order. Categories with higher weights contribute more to the score. Set to 0 to exclude a category from scoring.

How AAD Weights work

The weights tell the miner which categories matter most. For example, with a target of Severity having categories Fatal, Serious, Slight in that order:

  • 5, 1, 0 — Fatal is 5x more important than Serious, Slight is ignored
  • 1, 1, 0 — Fatal and Serious matter equally, Slight is ignored
  • 5, 0, 0 — only boost in Fatal counts

The miner computes the per-category improvement (how much the antecedent raises the occurrence of each category relative to the baseline), multiplies each by its weight, and sums them into a single aad_score. Only rules exceeding the AAD Score threshold are included in the results.

Submitting the Task

At the bottom of the quantifiers step, two actions are available:

  • Save Task — saves the task configuration for later execution
  • Run Task — saves the task and immediately dispatches it to the execution pipeline
tip

Start with a moderate AAD Score threshold (e.g. 20) and assign the highest weight to your rarest but most valuable category. You can always raise the threshold to tighten the result set after seeing the initial output.