There are two kinds of options for text attributes:
Note: Options for the Text (Test) step of an activity should be exactly the same as the options for the Text step of the activity.
When you are done, click OK to close the dialog and save any changes.
Specify the context index settings for the text column. The default selections will work well in many cases. For detailed information about these choices, see the Oracle Text documentation.
For each setting, you specify a Schema and a Name. The default schema in all cases is CTXSYS
, the Oracle Text schema for context indexes.
Note: We recommend that you use the default values in all instances, except for Stoplist.
DEFAULT_DATASTORE
is the default.DEFAULT_LEXER
is the default value.DEFAULT_WORDLIST
is the default value.DEFAULT_STORAGE
, the default value, indicates that the Text index tables are created in the index owner's default tablespace.CTXSYS.DEFAULT_STOPLIST
, or you can select a stoplist that you create. See Stopwords and Stoplists for an example of creating a stoplist.NOT APPLICABLE
.If the datatype of the text column is CLOB
or BLOB
, the Use multilexer selection is enabled; the default is to not use the multilexer.
If you are creating an index for a column that you have indexed before, click the Overwrite existing context index checkbox. The default is to not check the box.
A text feature is a combination of words that captures important attributes of a document or class of documents. Text features are usually keywords, frequencies of words, or other document-derived features. A document usually contains a large number of words and a much smaller number of features. Text features must be extracted from a text column.
These are the settings for feature extraction for the text column. The default selections will work well in many cases.
You can specify the following values:
CTX_DOC.THEMES
generates lists of themes.Copyright © 2006, 2008, Oracle. All rights reserved.