Data Cleaning

Data Cleaning

Description

Filter out less relevant contacts and attach messaging

Platform

Parabola

Schedule/Trigger

8

1. Merge Parvenu Verifier 1-10

In the last step we split results of the email verification workflow into ten separate Sheets for error handling. This made it so that if one part fails, the other nine succeed and the one gets re-tried later. Now that the data has passed the workflow, we can begin this workflow by using Stack Tables on Verifiers 1-10 to combine them into a single table. Then we use a second Stack Tables to merge the verifier table with the Parvenu Data Cleaned Sheet. The end result of this workflow sends to is the Parvenu Data Cleaned Sheet. By stacking the destination Sheet with the new data from the verifier Sheets at the beginning of the workflow, we can apply updated filters each time the workflow runs.

Next we will import the "Filters" tab and the "Linkedin Messaging" tab of the Session Cookie Rotation Sheet. We will Join the main table with these Sheets by the Filter ID. Then we will use Text Casing step on Company, Title, Title Not Contains, and Company Not Contains.

2. Filter by Title, Company, and Domain

We will apply the same four step workflow to each of the title, company and domain filters.

  1. Split the "X Not Contains" column into additional columns using a ", " as a delimiter.

  2. Add filter columns for the total number of filters. For example, 3 filters for the domain would add WebsiteFilter1, WebsiteFilter2, and WebsiteFilter3 as columns.

  3. If/Else creates "X Filter" and gives an "X" if the target filter field contains a value from any of the filter fields.

  4. Row Filter to remove rows with "X" in the "X Filter" column.

*"X" can be substituted for Title, Company, or Website.

For example, let's say we're filtering out .edu and .gov from the domain. This would be listed in the Website Not Contains column of the "Filters" tab of the Session Cookie Rotation as ".gov, .edu". Splitting the column Website Not Contains by ", " would give us two columns - WebsiteFilter1 with ".gov" and WebsiteFilter2 with ".edu."

We then use an Insert Static Columns step to add WebsiteFilter1, WebsiteFilter2, and WebsiteFilter3, while checking the box to not create the column if it already exists. Since in this example 1 and 2 already exist, this step would only create WebsiteFilter3.

Next we use the If/Else step to add the Domain Filter column, where if Domain contains WebsiteFilter1 and WebsiteFilter1 contains "." then set value to "X". We would also create this for WebsiteFilter2 and WebsiteFilter3. We would need to edit this step to add additional filters if we wanted to be able to add more filters to the Google Sheet.

In the case of WebsiteFilter1, it would add an X if the Domain contained .gov. It would also add an "X" if WebsiteFilter2 contains .edu.

We then repeat this process for the Title Not Contains and the Company Not Contains filters. Below are the number of filters supported per filter type.

Filter

Amount Supported

Domain Filter

3

Title Filter

30

Company Filter

30

Then we use a Remove Columns step to remove the 63 extra rows we created.

3. Assign Linkedin Connection Request Messaging

We start out by using an If/Else step to create the Seniority Level column. This column standardizes all the different variations titles into seniority. For example, three people who have "Vice President," "VP" and "V.P." in their Title would all be assigned the Seniority Level of VP.

Seniority = Linkedin Search Terms for Seniority

Seniority Level = Standardized seniority for variations of same title

Last updated

Was this helpful?