This example illustrates mining data where the build data is in two tables that must be joined before building the model.
For an overview of text mining, see Text Mining.
This example uses two tables for input:
MINING_DATA_BUILD_V
CUSTOMER_COMMENTS
, which contains customer comments arranged according to CUST_ID
Note: The table CUSTOMER_COMMENTS
is not available; you cannot create an activity that can be run.
This example adds complex additional data to a table. For more information about adding complex additional data, see "Complex Additional Data" in Chapter 3 - Overview of Mining Activity Guides in the Oracle Data Mining Tutorial; the tutorial includes screen shots that illustrate the process.
The table CUSTOMER_COMMENTS
has the following structure:
CUST_ID
, the primary key, type NUMBER, size 22COMMENTS
, type VARCHAR2, size 4000COMMENTS
are customer comments. There will be several different comments for each customer.
We want to predict customers for whom the value of AFFINITY_CARD
is 1.
These are the steps to build a classification model:
MINING_DATA_BUILD_V
. This is a view used by the Oracle Data Mining sample programs.Classification
as the Function Type (this is the default), and select Support Vector Machine
as the Algorithm.MINING_DATA_BUILD_V
. Check Join additional data with case table. Select CUST_ID
as the Unique Identifier. For all other choices, use the defaults. Click Next.CUSTOMER_COMMENTS
in the Available Tables list and move it to the Selected Tables list. Click Edit to define the relationship.CUST_ID
for both Case Table Column and Related Table Column.One to Many
for Relationship Type; you select One to Many
because there are several comments for each customer. In Transactional Column Mappings, click New to define a new mapping.COMMENTS
as the Value Column. Change the Mining Type to text
. Verify that Data is Sparse is checked. Accept all other defaults.
Note: You must change the mining type from categorical
to text
for text mining to take place. If the dropdown menu does not appear, you have selected an algorithm that does not support text mining.
AFFINITY_CARD
as the Target. Note that the Mining Type of COMMENTS
is text
.
Click OK.
DEMO_TEXT_COMPLEX_JOIN
.DEMO_TEXT_COMPLEX_JOIN
is displayed. Note that the activity has all of the steps of an activity that builds a Support Vector Machine model plus the steps Text and Test(Text). In these additional steps, Oracle Data Miner does all of the processing required to prepare the text column for mining. The Options for these steps support advanced text mining features such as customized stoplists.Copyright © 2008, Oracle. All rights reserved.