Skip to main content

Varicent ELT Help Center

Binning

Abstract

Group numeric values into categories.

Use the Binning tool to group and organize a column into categories. The Binning tool works with numeric values.

There are two ways to divide columns: quantile and interval. The Auto button automatically groups your data based on the number of specified bins. To define the bins manually, toggle the Auto button off.

Here's how the different options work:

  • Quantile. Equally, divide values into bins. Each bin has an equal number of records, but the value range in each bin is different. This option returns the bin and the bin midpoint.

  • Interval. Equally, divide a range of values into bins. The number of records in each bin varies, but the value range in each bin is the same. This option returns the bin and the bin midpoint.

Input

The Binning tool requires one data input.

Configuration

Use the following configuration options to help create your Binning configuration.

Configuring the Binning tool
  1. Go to the Pipes module from the side navigation bar.

  2. From the Pipes tab, click an existing pipe to open, or create a new pipe. To create a new pipe, read the Creating a pipe documentation.

  3. Locate the pipe you want to work with and click to open it.

  4. In the Pipe builder, add at least one data source to your pipe. For more information on adding a data source, see the Data tool.

  5. Click symon_add_icon.png + Tool.

    The Tools modal opens where you can add tools, such as the Aggregate tool to your pipe.

  6. In the Tools modal, search for the  Binning tool and then click + Add tool.

    Tip

    You can also find the Binning tool in the Organize section.

  7. Click the tool node and drag the line to the next tool to connect the tools. If you need to undo the action, click the line and then click Unlink.

  8. In the configuration pane, enter the following information:

    Table 80. Binning tool configuration

    Field

    Description

    Quantile or Interval

    Click to select the type of binning to divide the columns, either Quantile or Interval.

    • Quantile equally divides values into bins. Each bin has an equal number of records, but the value range in each bin is different. Returns the bin and the bin midpoint.

    • Interval equally divides a range of values into bins. The number of records in each bin varies, but the value range in each bin is the same. Returns the bin and the bin midpoint.

    Auto toggle

    Switch off to manually define the bins. Keep the toggle on to have the Binning tool automatically define the bins.

    Target column

    Select the column from your data set to organize into categories.

    Grouped columns

    Select the columns to group into a single column.

    New column name

    Enter the new name for your column.

    For example, if you are trying to find the quarterly sales figures for each of your sellers, you could name the column, Quarterly sales.

    Number of bins

    Enter the amount of bins that you need, or click the arrows to increase or decrease the number.



Usage example

Let's say you want to find out who your top and lower sellers. Using the Binning tool, you can group sellers into categories based on their sales revenue.

Using the quantile method
  1. In the Binning configuration pane, let's configure the following values:

    1. Click the Quantile button. Turn the Auto toggle switch off.

    2. In the Target column field, select SellerID.

    3. In the New column name field, enter Binned.

    4. In the Number of bins field, enter 2.

    5. In the Between fields, enter 50 in the first field and 50 in the second field.

  2. Build the pipe.

BinningExampleQuantile.png

The results show the sellers are grouped into two categories, >50 and <=50. This shows the sellers who are above 50% in sales, and those below 50% sales. You can now see who your top sellers are.