Merge & Fuzzy Merge
Combining two datasets into one
Last updated
Combining two datasets into one
Last updated
Beyond connecting a single dataset, you can create a new dataset by merging two or more datasets. Akkio supports both Exact Match and Fuzzy Match merging. Your data sources do not need to be in the same format or system; for example, you can have one from a CSV and another from Salesforce.
We will now merge two sets of housing data with a fuzzy merge. Select 'Merge' from the top left area of Prepare to open the guided process.
You can now select the Secondary Dataset to merge and the column to match. Two housing datasets are merged on the Sale Price column in the example below. You can select additional columns as well, such as Lot Area. Once the columns are selected, press 'Merge Datasets.'
For additional control, specifically of fuzzy merging, advanced options are available as part of the merge function.
By default, the matching sensitivity is “Exact Match Only,” which will only merge the datasets on exact column matches.
Alternatively, you can select “Fuzzy Match,” meaning the datasets will be merged, even if the column names aren’t identical.
The next setting is “Merge Type,” which allows you to select between these two options:
Keep all rows in the primary dataset.
Keep only rows that appear in both datasets.
Scored 0-1, this value demonstrates how similar the records are, with 1 being a perfect match and 0 being completely different.
The current UI only supports horizontal merging. Appending rows/ merging vertically is not currently an option within the UI. However, this can be configured through the API.
Exact Merge is available to Basic and Professional level plans. While Fuzzy Merge is only available to Professional.