Create a matcher

A Matcher automatically detect duplicates records and group them into clusters.

Only one matcher can be created for each entity.

Create a matcher

To create a matcher:

  1. Expand the entity node, right-click the Matcher node and select Define SemQL Matcher…. The Create New SemQL Matcher wizard opens.

  2. In the Description field, optionally enter a description for the Matcher.

  3. Click Finish to close the wizard. The SemQL Matcher editor opens.

Define the match rules

To define the match rules:

  1. In the matcher, scroll to the Match Rules table.

  2. Click the Add Match Rule button in the Match Rules tables. The Match Rule: NewRule editor opens.

  3. Give a Name and internal Description (intended for model designers) to the match rule.

  4. Give a user-friendly Label and Documentation. In the documentation, use plain text or the Markdown syntax for rich text. This text provides detailed documentation for the rule. It appears in the documentation side nav.

  5. Set a Color for the match rule. This color is used, in the match graphs, for the edges linking records matched by this rule. Note that if you do not set a color, a random color will be assigned to the rule.

  6. If you want to use matching on child records, check the Match on Child Records option, and select the Child Records you want to use for matching.

  7. Define the Binning Expressions:

    1. In the Binning Expressions table, click the Add Binning Expression button. The SemQL editor opens.

    2. Create a SemQL expression used to bin records for this entity, and then click OK to close the SemQL Editor.

    3. Repeat the previous steps to create all your binning expressions.

  8. Define the Match Condition and Match Score:

    1. In the Matching section, click the edit expression button Edit Expression button. The SemQL editor opens.

    2. Create a SemQL condition used to match records for this entity, and then click OK to close the SemQL Editor.

    3. Enter a value for the Match Score. This value should be between 1 and 100.

  9. Press Control+S (or Command+S on macOS) to save the editor.

  10. Use the breadcrumb on top of the editor to return to the matcher. The new match rule appears in the list.

  11. Repeat the previous steps to create all the match rules.

  12. Press Control+S (or Command+S on macOS) to save the editor.

Configure grouping and scoring

To configure grouping and scoring:

  1. In the matcher, scroll to the Grouping and Scoring section.

  2. Select whether to Use Transitive Match Score. See direct and transitive scoring for more information.

  3. Select whether to use Multi-Iteration Grouping. See multi-iterating grouping for more information.

  4. Press Control+S (or Command+S on macOS) to save the editor.

Configure the merge and auto-confirm policies

To configure the merge and auto-confirm policies:

  1. In the matcher, define the Merge Policy: Set the minimum confidence score required for a match group to be merged in the various merge cases.

  2. Define the Auto-Confirm Policy:

    1. Auto-confirm golden records: Minimum confidence score required for a match group to be automatically confirmed.

    2. Auto-confirm singletons: Select this option to have singletons (un-matched records) automatically confirmed.

  3. Press Control+S (or Command+S on macOS) to save the editor.