When collecting back results from a workflow, you may want to only get back certain rows that match a criteria you specify. E.g. if you're scraping Twitter users, you may want to only get back rows of users who have a specific number of followers.
To do this, just navigate to the workflow you want to apply the filter to and scroll down to "Extractors" and click on the extractor you want to apply the filter to (e.g. Twitter User Followers):
Then click the "Edit" button on the extractor screen to edit it and scroll down to "Filtering" where you can specify your criteria:
You want to enter the "Output Field" after the column name you want to filter on (from the CSV export). So you'll typically want to run the workflow once to get the output file, then inspect the output file. In this case, the workflow outputs this file (without the filter):
Here, we see that the follower count is in the
users.followers_count column from the original output, so we want to enter this as our "Output Field" when declaring the filter. Enter this, then select "Within Range" as filter type and enter a lower and upper bound for the filter:
After entering this information, scroll down and hit save changes. You'll confirm that the filter was saved because the extractor will now look like this (showing the filter definition):
That's it! Now when you run the workflow again it will only return rows of Twitter users with 100 followers, but not more than 999 followers.