This example illustrates mining data where the build data is in two tables that must be joined before building the model.
For an overview of text mining, see Text Mining.
This example uses two tables for input:
MINING_DATA_BUILD_V
SH.SUPPLEMENTARY_DEMOGRAPHICS
, which contains customer comments arranged according to customer IDYou install the SH schema and set appropriate access to it when you install the Data Mining Sample programs, as described in the Oracle Data Mining Administrators Guide.
This example adds additional data to a table. For more information about adding additional data, see "Simple Additional Data" in Chapter 3 - Overview of Mining Activity Guides in the Oracle Data Mining Tutorial; the tutorial includes screen shots that illustrate the process.
We want to predict customers for whom the value of AFFINITY_CARD
is 1.
Follow these steps to build a classification model:
Classification
as the Function Type (this is the default), and select Support Vector Machine
as the Algorithm.MINING_DATA_BUILD_V
. Check Join additional data with case table. Select CUST_ID
as the Unique Identifier. For all other choices, use the defaults. Click Next.
Note: Oracle Data Mining requires that the data for text mining be a table, not a view. If you provide a view as input to a Mining Activity, the view is automatically converted.
SH.SUPPLEMENTARY_DEMOGRAPHICS
in the Available Tables list and move it to the Selected Tables list. Click Edit to define the relationship.CUST_ID
for both Case Table Column and Related Table Column.One to One
for Relationship Type. In Selected Table Columns deselect all attributes except for COMMENTS
; the only data that is needed from SH.SUPPLEMENTARY_DEMOGRAPHICS
is user comments. Click OK to close the Edit Relationship Window.
Then click Next to finish Step 3 of the Wizard.AFFINITY_CARD
as the Target. Change the mining type of SH.SUPPLEMENTARY_DEMOGRAPHICS.COMMENTS
to text
: Select the COMMENTS
row, click in the Mining Type column for COMMENTS
and select text
from the dropdown menu.
Note: You must change the mining type from categorical
to text
for text mining to take place. If the dropdown menu does not appear, you have selected an algorithm that does not support text mining.
DEMO_TEXT_JOIN
.DEMO_TEXT_JOIN
is displayed. Note that the activity has all of the steps of an activity that builds a Support Vector Machine model plus the steps Text and Test(Text). In these additional steps, Oracle Data Miner does all of the processing required to prepare the text column for mining. The Options for these steps support advanced text mining features such as customized stoplists.MINING_DATA_BUILD_V
, which does not contain the user comments, the Predictive Confidence is approximately 60%.Copyright © 2008, Oracle. All rights reserved.