How do I find Duplicate Records in my Database?
Overview
Invenias includes a powerful duplicate checking mechanism which is designed to prevent duplicate records being created. Each time a Record is created in Invenias through a parsing action, or by manually creating a record Invenias will check the database for possible matches. If any potential matches are found, these are suggested to the User and they can choose to match to the existing record or continue and create a new record.
There is however always a possibility that duplicate records will be created which presents an ongoing problem with data quality and difficulties when trying to locate known records in the database. This problem becomes more of an issue with larger databases as the duplicates can be very difficult to identify among many thousands or even millions of records.
The attached reports can be used to help identify possible duplicate records in your database, allowing a User to then view and either merge, delete or ignore the potential duplicates.
The Invenias Professional Services team are experienced in offering data cleansing as a service. We can analyse your database and provide guidance on setting rules to identify and remove records which do not offer value to retain. For more information, please contact inveniassupport@bullhorn.com.
Click here to view a guide on how to import the reports which are discussed below and attached at the end of this article.
This article covers:
- Company Duplicates Basic Report
- Company Duplicates Extended Report
- People Duplicates Report
- Downloads
Company Duplicates Basic Report
This report checks for possible duplicates on People records using a complex combination of queries based on matches on various fields including: Email, Name/Job Title/Company Name, Mobile Phone.
The Records flagged as possible duplicates are grouped together and show basic information from the records. You can open the Records using the links in the report and then delete or merge as you wish.
An example report is shown below:
Company Duplicates Extended Report
As above but this reports excludes accent characters AND noisewords in Company names.
An example report is shown below. As Invenias LTD contains the noiseword LTD, this report has flagged it as a potential duplicate of "Invenias", as after removing the noiseword it is a match on name:
People Duplicates Report
This report checks for possible duplicates on People records using a complex combination of queries based on matches on various fields including: Email, Name/Job Title/Company Name, Mobile Phone.
The Records flagged as possible duplicates are grouped together and show basic information from the records. You can open the Records using the links in the report and then delete or merge as you wish.
An example report is shown below: