Automatic Data Preparation

Automatic Data Preparation (ADP) is a new feature in Oracle Data Mining 11g.

In most cases, data must be transformed using techniques such as binning, normalization, or missing value treatment before it can be mined. Data for build, test, and apply must undergo the exact same transformations. Oracle Data Miner mining activities determine the transformations that are required and applies them.

Oracle Data Mining 11g allows automatic data preparation. Algorithm-appropriate transformation instructions are embedded in the model and automatically applied to the build data and scoring data. The automatic transformations can be complemented by or replaced with user-specified transformations.

Because they contain the instructions for their own data preparation, mining models that use ADP are known as supermodels.

Oracle Data Miner indicates which models were built using ADP. Oracle Data Miner uses ADP when it builds Generalized Linear Models.