Text Options

There are two kinds of options for text attributes:

Note: Options for the Text (Test) step of an activity should be exactly the same as the options for the Text step of the activity.

When you are done, click OK to close the dialog and save any changes.

Context Index

Specify the context index settings for the text column. The default selections will work well in many cases. For detailed information about these choices, see the Oracle Text documentation.

For each setting, you specify a Schema and a Name. The default schema in all cases is CTXSYS, the Oracle Text schema for context indexes.

Note: We recommend that you use the default values in all instances, except for Stoplist.

If the datatype of the text column is CLOB or BLOB, the Use multilexer selection is enabled; the default is to not use the multilexer.

If you are creating an index for a column that you have indexed before, click the Overwrite existing context index checkbox. The default is to not check the box.

Feature Extraction

A text feature is a combination of words that captures important attributes of a document or class of documents. Text features are usually keywords, frequencies of words, or other document-derived features. A document usually contains a large number of words and a much smaller number of features. Text features must be extracted from a text column.

These are the settings for feature extraction for the text column. The default selections will work well in many cases.

You can specify the following values: