Generators

Address

Generates a random address-like string. This can be applied to various parts of an address string. For example, just the street address, just the city, or a full address string. This generator can be made consistent and can be linked with another address generator to preserves the relation between the data. This way the masked data will end up producing a valid city/state/etc. pair.

Algebraic

The algebraic generator identifies the algebraic relationship between 3 or more numeric values (at least one non-integer) and generates new values to match. If a relationship cannot be found it defaults to the categorical generator. This generator can be linked with another algebraic generator.

Alphanumeric String Key

Generates unique alpha numeric strings of the same length as the input. This generator can be made consistent.

Array JSON Mask

Run a generator on values that match a user specified JSONPath.

Array Character Scramble

This generator replaces letters with random other letters, and numbers with random other numbers; punctuation and whitespace are preserved. This generator securely masks letters and numbers, there's no way to recover the original data. This generator can be made consistent.

ASCII Key

Generates unique alpha-numeric strings based on any printable ASCII characters. The length of the string will not be preserved. This generator can be made consistent.

Categorical

A categorical generator creates values at the same frequency and values of the underlying data. This generator can be linked with another Categorical generator and can be made differentially private.

Character Scramble

This generator replaces letters with random other letters, and numbers with random other numbers; punctuation and whitespace are preserved. This generator securely masks letters and numbers, there's no way to recover the original data. This generator can be made consistent.

Character Substitution

Random character replacement that preserves formatting (spaces, capitalization, and punctuation). It's best if you need to preserve a join between two text columns, such as a join on a name or email. This generator can be linked with other character substitution generators.

Company Name

Generates a random company name like string. This generator can be made consistent.

Conditional

Apply different generators to rows conditionally based on any value in this table.

Constant

Generates a single value (based on user input) that is used to mask all values in the column.

Continuous

Generates a continuous distribution using a normal distribution to fit the underlying data. This generator can be linked to other Continuous Generators to create multi-variate distributions and can be partitioned by other columns.

Cross Table Sum

The cross table sum generator links columns in two tables. This column will be the sum of the values in another column. There is no preview for this generator as the sums cannot be computed until the other table is generated.

Custom Categorical

A categorical generator that selects from values you provide. This generator can be made consistent.

Date Truncation

Truncates dates to the specific date part.

Email

This generator scrambles characters while preserving formatting and keeping the '@', '.' This generator securely masks letters and numbers, there's no way to recover the original data. This generator can be made consistent.

Events

Generates timestamps fitting an event distribution. Link columns to create a sequence of events across multiple columns. This generator can be partitioned by other columns.

File Name

This generator scrambles characters while preserving formatting and keeping the file extension intact. This generator securely masks letters and numbers, there's no way to recover the original data. This generator can be made consistent.

Find and Replace

This generator replaces all instances of the find string with the replace string. If "Use regex" is enabled, use backslash ( \ ) as the escape character.

HIPAA Address

This generator can be used to generate cities, states, and zip codes that follow HIPAA guidelines for safe harbor.

Zip Codes

When generating zip codes, we examine the underlying zip code in the column and replace the last three digits with 0 unless the zip code is a low population area as designated by the current census, in which case we replace all digits in the zip code with 0.

Cities

When a zip code column has not been linked then we simply choose a random city in the United States. When a zip code has already been added to the link, however, we make sure to choose a city, at random, which has at least some overlap with the zip code.

For example, if the original city and zip code were (Atlanta, 30305) we would replace the zip code with 30300. There are many cities that contain zip codes beginning in 303 such as Atlanta, Decatur, Chamblee, Hapeville, Dunwoody, College Park, etc.). One of these cities is chosen at random so that our final value is (Chamblee, 30300), for example.

If the original zip code is designated as a low population area then we just choose a random city within the state, however, we only do this if the user has linked a State column. If they have not, we simply choose a random city anywhere in the United States.

States

HIPAA guidelines allow for information at the state level to be kept. Therefore, we passthrough these values.

Other address parts

All other address parts are generated randomly and hence there value is not influenced at all by the underlying value in the column.

Hostname

Generates random host names, based on the English language. This generator can be made consistent.

Integer Key

Generates integer values between 0 and 2^32 - 1. Input Values must be in the range 0 to 2^31 - 1 as well. This generator can be made consistent.

IP Address

Generates a random IP address formatted string. This generator can be made consistent.

JSON Mask

Run a generator on values that match a user specified JSONPath.

MAC Address

Generates a random MAC address formatted string. This generator can be made consistent.

Name

Generates a random name string that can be supplied with a number of tags to identify the type of name (First; Last; First,Last; Last,First, etc.). This generator can be made consistent.

Null

Generates NULL values to fill the rows of the specified column.

Numeric String Key

Generates unique numeric strings of the same length as the input. This generator can be made consistent.

Passthrough

Default option, does not mask data.

Phone

Generates a random phone number that matches the country/region of the input phone number. This generator can be made consistent. These numbers will pass Google's libphonenumber verification

Random Boolean

Generates a random boolean value. This generator can be linked with other random boolean generators.

Random Double

Generates a random double number between the specified min and max. For this generator, the "min" is inclusive and the "max" is exclusive.

Random Hash

Generates a random hash string.

Random Integer

Returns a random integer between the specified min and max. For this generator, the "min" in inclusive and the "max" is exclusive

Random Timestamp

Generates random dates, times, and timestamps.

Random UUID

Generates a random new UUID string.

Sequential Integer

Generate a column of unique integer values. Starting value is 0 and increments in increments of 1. This generator can be linked with other sequential integer generators.

Shipping Container

Generates values of ISO 6346 compliant shipping container codes. All generated codes are in the freight category ("U"). This generator can be made consistent.

SIN

Generates a new valid Canadian Social Insurance Number that preserves formatting (non-digit characters). This generator can be made consistent.

SSN

Generates a new valid United States Social Security Number. This generator can be made consistent.

Unique Email

Generates unique e-mail addresses by replacing the username with a randomly generated GUID and masking the domain with a character scramble. This Generator only guarantees uniqueness if the underlying column is unique. This generator can be made consistent.

URL

This is a substitution cipher that preserves formatting but keeps the URL scheme and top-level domain intact. This mask is not secure.

UUID Key

Generates UUIDs on Primary Key columns. All FK columns referencing this column will automatically have their UUID values masked as well.

XML Mask

Run a generator on values that match a user specified XPath

‚Äč