Search…
⌃K
Links

Generator summary

The following table summarizes the available generators. It indicates whether each generator can be made consistent, can be linked, and is differentially private.
In the Consistency column, the table also indicates whether the generator can be made self-consistent only, or can be made either self-consistent or consistent with another column.
For generators that can be data-free, the Description column includes whether the generator is always data-free, or only data-free when consistency is disabled.
Generator
Description
Consistency
Linking
Differential Privacy
Address
Generates a random string to replace a specific part of a mailing address. Data-free if not consistent.
Yes - Self or other
Yes
Yes if not consistent
Uses deep neural networks for high-fidelity data mimicking. By default, not available.
No
No
No
Algebraic
Identifies the algebraic relationship between 3 or more numeric values (at least one non-integer) and generates new values to match.
No
Yes
No
Generates unique alphanumeric strings of the same length as the input.
Yes - Self
No
No
Within an array, replaces letters with random other letters, and numbers with random other numbers.
Yes - Self
No
No
Runs a selected generator on values that match a user-specified JSONPath.
--
--
--
Runs a selected generator on values that match a regular expression.
--
--
--
ASCII Key
Generates unique alpha-numeric strings based on any printable ASCII characters.
Yes - Self
No
No
Creates values at the same frequency as the values in the underlying data.
No
Yes
Configurable
Replaces letters with random other letters and numbers with random other numbers.
Yes - Self
No
No
Replaces characters randomly, but preserves formatting.
Yes - Implicitly consistent
No
No
Generates a random company name-like string. Data-free if not consistent.
Yes - Self or other
No
Yes if not consistent
Applies different generators to rows conditionally based on any value in the table.
No
No
No
Constant
Uses a single specified value to mask all values in the column. Data-free.
No
No
Yes
Generates a continuous distribution to fit the underlying data.
No
Yes
Configurable
Populates the column using the sum of the values in other columns.
No
No
No
CSV Mask
Masks a text column.
Parses the text as a row for which the columns are delimited by a specified character.
--
--
--
Selects from values you provide. Data-free if not consistent.
Yes - Self
No
Yes if not consistent
Truncates dates or timestamps to a specific date or time part.
No
No
No
Email
Scrambles characters in an email address.
Preserves the formatting and keeps the @ and ..
Yes - Self
No
No
Generates timestamps that fit an event distribution.
No
Yes
No
File Name
Scrambles characters in a file name.
Preserves the formatting and the file extension.
Yes - Self
No
No
Replaces all instances of the find string with the replace string.
No
No
No
Geo
Masks columns that contain latitude and longitude values.
No
No
No
Can be used to generate cities, states, zip codes, and latitude/longitude values that follow HIPAA guidelines for safe harbor.
Yes - Self
No
No
Hostname
Generates random host names, based on the English language. Data-free if not consistent.
Yes - Self or other
No
Yes if not consistent
Runs selected generators on specified key values in an HStore column in a PostgreSQL database.
--
--
--
HTML Mask
Masks text columns.
Parses the contents as HTML, and applies sub-generators to the specified path expressions.
--
--
--
Generates unique integer values.
By default, the generated values are within the range of the column’s data type.
You can also specify a range for the generated values. The source values must be within that range. Data-free if not consistent.
Yes - Self
No
Yes if not consistent
Generates a random IP address-formatted string. Data-free if not consistent.
Yes - Self or other
No
Yes if not consistent
JSON Mask
Runs a generator on values that match a user specified JSONPath.
--
--
--
Generates a random MAC address formatted string. Data-free if not consistent.
Yes - Self
No
Yes if not consistent
Name
Generates a random name string from a dictionary of first and last names. Data-free if not consistent.
Yes - Self
No
Yes if not consistent
Masks values in numeric columns.
Adds or multiplies the original value by random noise.
Yes - Self or other
No
No
Null
Generates NULL values to fill the rows of the specified column. Data-free.
No
No
Yes
Generates unique numeric strings of the same length as the input.
Yes - Self
No
No
For MongoDB only. Generates unique objectId values.
Yes - Self
No
No
Default generator. Does not perform any action on the source data.
No
No
No
Phone
Generates a random phone number that matches the country or region and format of the input phone number.
Yes - Self
No
No
Generates a random boolean value. Data-free.
No
No
Yes
Generates a random double number between the specified min and max. Data-free.
No
No
Yes
Generates a random hash string. Data-free
No
No
Yes
Returns a random integer between the specified min and max. Data-free.
No
No
Yes
Generates random dates, times, and timestamps. Data-free.
No
No
Yes
Generates a random new UUID string. Data-free.
No
No
Yes
Uses regular expressions to parse strings.
Replaces specified substrings with output from selected sub-generators.
--
--
--
Generates a column of unique integer values that start with specified value and increment by 1.
No
Yes
No
Generates values of ISO 6346 compliant shipping container codes. Data-free if not consistent.
Yes - Self or other
No
Yes if not consistent
SIN
Generates a new valid Canadian Social Insurance Number. Data-free if not consistent.
Yes - Self
No
Yes if not consistent
SSN
Generates a new valid United States Social Security Number. Data-free if not consistent.
Yes - Self or other
No
Yes if not consistent
Can apply other generators on specific StructFields within a StructType in Spark databases (Databricks and Amazon EMR).
--
--
--
Shifts timestamps by a random amount of a specific unit of time, within a set range.
Yes - Self or other
No
No
Generates unique email addresses.
Replaces the username with a randomly generated GUID, and masks the domain with a character scramble.
Yes - Self
No
No
URL
A substitution cipher that preserves formatting but keeps the URL scheme and top-level domain intact.
No
No
No
UUID Key
Generates UUIDs on primary key columns.
Yes - Self
No
No
XML Mask
Runs a selected generator on values that match a user-specified XPath.
--
--
--