Generator summary
The following table summarizes the available generators. It indicates whether each generator can be made consistent, can be linked, and is differentially private.
In the Consistency column, the table also indicates whether the generator can be made self-consistent only, or can be made either self-consistent or consistent with another column.
For generators that can be data-free, the Description column includes whether the generator is always data-free, or only data-free when consistency is disabled.
Generator | Description | Consistency | Linking | Differential Privacy |
---|---|---|---|---|
Generates a random string to replace a specific part of a mailing address.
Data-free if not consistent. | Yes - Self or other | Yes | Yes if not consistent | |
Uses deep neural networks for high-fidelity data mimicking.
By default, not available. | No | No | No | |
Identifies the algebraic relationship between 3 or more numeric values (at least one non-integer) and generates new values to match. | No | Yes | No | |
Generates unique alphanumeric strings of the same length as the input. | Yes - Self | No | No | |
Within an array, replaces letters with random other letters, and numbers with random other numbers. | Yes - Self | No | No | |
Runs a selected generator on values that match a user-specified JSONPath. | -- | -- | -- | |
Runs a selected generator on values that match a regular expression. | -- | -- | -- | |
Generates unique alpha-numeric strings based on any printable ASCII characters. | Yes - Self | No | No | |
Creates values at the same frequency as the values in the underlying data. | No | Yes | Configurable | |
Replaces letters with random other letters and numbers with random other numbers. | Yes - Self | No | No | |
Replaces characters randomly, but preserves formatting. | Yes - Implicitly consistent | No | No | |
Generates a random company name-like string.
Data-free if not consistent. | Yes - Self or other | No | Yes if not consistent | |
Applies different generators to rows conditionally based on any value in the table. | No | No | No | |
Uses a single specified value to mask all values in the column.
Data-free. | No | No | Yes | |
Generates a continuous distribution to fit the underlying data. | No | Yes | Configurable | |
Populates the column using the sum of the values in other columns. | No | No | No | |
Masks a text column. Parses the text as a row for which the columns are delimited by a specified character. | -- | -- | -- | |
Selects from values you provide.
Data-free if not consistent. | Yes - Self | No | Yes if not consistent | |
Truncates dates or timestamps to a specific date or time part. | No | No | No | |
Scrambles characters in an email address. Preserves the formatting and keeps the @ and . . | Yes - Self | No | No | |
Generates timestamps that fit an event distribution. | No | Yes | No | |
Scrambles characters in a file name. Preserves the formatting and the file extension. | Yes - Self | No | No | |
Replaces all instances of the find string with the replace string. | No | No | No | |
Masks columns that contain latitude and longitude values. | No | No | No | |
Can be used to generate cities, states, zip codes, and latitude/longitude values that follow HIPAA guidelines for safe harbor. | Yes - Self | No | No | |
Generates random host names, based on the English language.
Data-free if not consistent. | Yes - Self or other | No | Yes if not consistent | |
Runs selected generators on specified key values in an HStore column in a PostgreSQL database. | -- | -- | -- | |
Masks text columns. Parses the contents as HTML, and applies sub-generators to the specified path expressions. | -- | -- | -- | |
Generates unique integer values. By default, the generated values are within the range of the column’s data type. You can also specify a range for the generated values. The source values must be within that range.
Data-free if not consistent. | Yes - Self | No | Yes if not consistent | |
Generates a random IP address-formatted string.
Data-free if not consistent. | Yes - Self or other | No | Yes if not consistent | |
Runs a generator on values that match a user specified JSONPath. | -- | -- | -- | |
Generates a random MAC address formatted string.
Data-free if not consistent. | Yes - Self | No | Yes if not consistent | |
Generates a random name string from a dictionary of first and last names.
Data-free if not consistent. | Yes - Self | No | Yes if not consistent | |
Masks values in numeric columns. Adds or multiplies the original value by random noise. | Yes - Self or other | No | No | |
Generates NULL values to fill the rows of the specified column.
Data-free. | No | No | Yes | |
Generates unique numeric strings of the same length as the input. | Yes - Self | No | No | |
For MongoDB only.
Generates unique objectId values. | Yes - Self | No | No | |
Default generator.
Does not perform any action on the source data. | No | No | No | |
Generates a random phone number that matches the country or region and format of the input phone number. | Yes - Self | No | No | |
Generates a random boolean value.
Data-free. | No | No | Yes | |
Generates a random double number between the specified min and max.
Data-free. | No | No | Yes | |
Generates a random hash string.
Data-free | No | No | Yes | |
Returns a random integer between the specified min and max.
Data-free. | No | No | Yes | |
Generates random dates, times, and timestamps.
Data-free. | No | No | Yes | |
Generates a random new UUID string.
Data-free. | No | No | Yes | |
Uses regular expressions to parse strings. Replaces specified substrings with output from selected sub-generators. | -- | -- | -- | |
Generates a column of unique integer values that start with specified value and increment by 1. | No | Yes | No | |
Generates values of ISO 6346 compliant shipping container codes.
Data-free if not consistent. | Yes - Self or other | No | Yes if not consistent | |
Generates a new valid Canadian Social Insurance Number.
Data-free if not consistent. | Yes - Self | No | Yes if not consistent | |
Generates a new valid United States Social Security Number.
Data-free if not consistent. | Yes - Self or other | No | Yes if not consistent | |
Can apply other generators on specific StructFields within a StructType in Spark databases (Databricks and Amazon EMR). | -- | -- | -- | |
Shifts timestamps by a random amount of a specific unit of time, within a set range. | Yes - Self or other | No | No | |
Generates unique email addresses. Replaces the username with a randomly generated GUID, and masks the domain with a character scramble. | Yes - Self | No | No | |
A substitution cipher that preserves formatting but keeps the URL scheme and top-level domain intact. | No | No | No | |
Generates UUIDs on primary key columns. | Yes - Self | No | No | |
Runs a selected generator on values that match a user-specified XPath. | -- | -- | -- |
Last modified 13d ago