Text Feature Extraction Settings

A text feature is a combination of words that captures important attributes of a document or class of documents. Text features are usually keywords, frequencies of words, or other document-derived features. A document typically contains a large number of words and a much smaller number of features. Text features must be extracted from a text column.

These are the settings for feature extraction for the text column. The default selections will work well in many cases.

You can specify the following values:

Click Next to proceed.

For more information about feature extraction for text, see the Oracle Text documentation. Where to Find More Information describes how to locate Oracle Text documentation.