Oracle Data Miner Preferences

Use this selection to specify certain defaults and settings for Oracle Data Miner. There are the following tabs:

When you are done, click OK to make the changes or Cancel to make no changes.

Environment

Use the Environment tab to specify the working directory, the location of SQL*Loader, and the Look and Feel of the user interface:

Sampling

Use the Sampling tab to specify the following:

Note: Sampling size cannot be specified for transactional (multi-record case) data.

This value does not apply to the Sample and Stratified Sample wizards; these wizards use a default sample size of 10,000.

Data

Use the Data tab to specify the characteristics of data profiling.

The first set of constants specify characteristics of unique values:

The next set of constants specify Warn When limits related to numbers of bins. If your data exceeds one of these limits, you are warned. For example, if you have more than 250 bins for a categorical attribute, you are warned.

Connections

Use the Connections tab to view and manage database connections.

All defined connections are listed in the Connections pane.

To define a new connection, click New and proceed as described in Define and Edit Connections.

To edit or view a connection, select the connection name in the Connections pane, click Edit, and proceed as described in Define and Edit Connections. Passwords are encoded.

To delete a connection, select the connection name in the Connections pane, and click Delete.

Tasks

Use the Tasks tab to control how long tasks are displayed in the Active Tasks pane. The default is to display all tasks that are active (queued or executing) and all tasks that completed (with a status such as SUCCEEDED, FAILED, or STOPPED) within the last 60 minutes. You can change how long you display completed tasks by changing the number of minutes. You must specify an integer number of minutes.

Discretizing

Use the Discretizing tab to specify how the discretization wizard generates bin definitions. The default behavior is to generate bins with names that are Integers. You may select character Strings. Certain algorithms may require integers for bin names; a mining activity can override this selection, if necessary.

Clustering Model

Use the Clustering Model tab to control the precision of numerical values displayed in cluster details. Precision refers to the number of digits to the right of the decimal point; for example, 29.06 has precision 2.

You can choose User Specified Precision with a Precision is an integer >= 0 and <=39, or you can choose Maximum Precision. The default is User Specified Precision with Precision equal to 1.

If you choose Maximum Precision, numbers may be expressed in scientific notation if scientific notation is required to display a value. If some numbers are in scientific notation, you will not be able to sort centroid values properly, because of the mix of notations.