Thursday, March 8, 2012

Data Mining slicing / filtering

Is there a way to dynamically slice / filter data when running a data mining query?

(Dynamically means - not deployed as part of the structure)

For example - I’d like find associations to products sold in different geographic locations.

Does the answer differ if the structure is based on a cube or on a rational DB?

The answer is no - the counts of product associations are the counts for the products with which the model was changed. You would have to create specific structure for each subset of the data you would want to model.

You may be able to use many-many dimensions in OLAP and then provide the appropriate slice to see association counts by region, but you won't have a mining model that can perform predictions, and you will only see pairwise counts.

No comments:

Post a Comment