Cleanse
  • 13 Jan 2023
  • 1 Minute to read
  • Dark
    Light

Cleanse

  • Dark
    Light

Article summary

Description

The Cleanse node cleanses text columns in a data set.


Configuration Options

Basic Configuration Options

SettingDescription\Parameters
Text Columns to CleanseColumns from the parent node that should have text cleansing applied to them.
Trim Leading/Trailing SpacesRemoves all leading and trailing spaces from the text field.
Datetime BreakoutBreaks a datetime data type column out into six string columns representing year/month/day/hour/minute/second.
Case ConversionOptions include Upper or Lower.
Replace Common NamesReplaces a list of common names with the designated text string.
Replace Stop WordsReplaces a list of stop words with the designated text string.
EncryptionOptions include Encrypt or Decrypt to encrypt/decrypt a field.

Advanced Configuration Options

Regular Expression Replacement

SettingDescription
DescriptionDescription of the replacement.
Search for RegExRegular expression used for pattern matching.
Replace WithReplacement text.

Stop Words and Common Names

SettingDescription
Stop Wordsa, able, about, across, after, all, almost, also, am, among, an, and, any, are, as, at, be, because, been, but, by, can, cannot, could, dear, did, do, does, either, else, ever, every, for, from, get, got, had, has, have, he, her, hers, him, his, how, however, i, if, in, into, is, it, its, just, least, let, like, likely, may, me, might, most, must, my, neither, no, nor, not, of, off, often, on, only, or, other, our, own, rather, said, say, says, she, should, since, so, some, than, that, the, their, them, then, there, these, they, this, tis, to, too, twas, us, wants, was, we, were, what, when, where, which, while, who, whom, why, will, with, would, yet, you, your

CommonNames.txt


Actions

ActionDescription
Add RegexAdds a new regular expression which allows you to perform text replacement using regular expressions.
PreviewOnce the node is configured, the combined result set can be previewed at any time.

Was this article helpful?

What's Next