On the Search options tab you can define the settings for subset fields as well as for the duplicate check threshold value, and activate the duplicate check when importing addresses.
The duplicate check compares specific addresses against other specific addresses. This means that the whole address data stock is divided into subsets in which each address is checked against all the other addresses in the subset.
In the Subset fields area, you can define how many characters a field value can have for certain fields as this helps to limit the amount of data compared. In the drop-down list in the Subset fields area, you will see the fields for which you can limit the number of characters, this helps reduce the amount of data being compared.
You can enter values from 0 to 5. If you set the value to 0, then the fields will not be included in the subset. The higher the selected value, the more subsets are created and the more effective the duplicate search is. If, for example, you form subsets based on the first 2 characters of a postal code, then more addresses will be included in the subset as taking the first 3 characters of the postal code which limits the amount of data to be compared.
In the threshold area you can define a percentage value for address matching which is used to discern duplicates.
In the Duplicate checking while importing area, you can define whether a duplicate should run during an address import. Any duplicates found are highlighted, these can be merged by users if they have sufficient rights to do so in the CAS genesisWorld Desktop Client.
There are a few specifics you should know about if you have enabled the duplicate check when importing address data.
In the Additional settings area you can define the following settings.
When calculating the probability of whether a duplicate exists, the fields selected for the duplicate check are checked on a field level. If the option is active, then any empty fields in an address data record are ignored if another address which is compared to the first contains a value in the same field.