ADVIZOR Help
v7.2+
v7.2+
  • Introduction
  • Overview
    • ADVIZOR Help
    • ADVIZOR Overview
      • Analyst
      • Analyst/X
      • Data Blender
      • Desktop Navigator
      • Server AE
      • Managed Hosting
  • Using ADVIZOR
    • File Ribbon
      • Open an Existing ADVIZOR Project
      • Restore a Backup Project Version
      • Save a Project
      • Template Library
      • Update Expired Credentials
    • Analyze Ribbon
      • Select and Exclude Data
      • Flight Recorder
      • Set Coloring
        • Use Color Scale
        • Use Color By
        • Color Legend
      • Navigation Pane
    • Author Ribbon
      • Charts, Pages, and Dashboards
        • Composing Pages with Charts
        • Page Gallery
      • Load Data
        • Load New Data Using the Data Wizard
        • Load Text Data
        • Load Microsoft Excel Data
        • Load Microsoft Access Data
        • Load SQL Server Data
        • Load Oracle Data
        • Load a Database via ODBC
        • Manage Data Sources
        • Replace an Existing Data Source
      • Design Pages
        • Create Navigation Pane Content
        • Rearrange Charts
        • Change Chart Fonts
      • Use Color Models
        • Manage Color Models
        • Assign Color Models to Pages
        • Color Workshop
        • Identify How Color Is Applied to Data
        • Uncolorable Tables
      • Configure Charts
        • Property Explorer
        • Link Unmatched Rows
        • Show Missing Values
        • Use Polygon Map Format
        • Use FocusFormat Property
      • Condition Data
        • Project Workshop
        • Use the Expression Builder
        • Use the Link Wizard
        • Delete a Link
        • Date Formatter
        • Configure Data Hierarchies
      • Explore Data Usage
      • Identify Issues with Legacy Projects
      • Data Pool Visualization
    • Model Ribbon
      • Predictive Analytics: Analyst/X
      • Analytics Process
        • Bin a Categorical Field
        • Date Fields
        • Zip Codes
      • Predictive Modeling Pane
      • Configuring a Model
      • Managing Models
    • Share Ribbon
      • Share Results
      • Export Tables
      • Deployment and ADVIZOR Server
        • ADVIZOR Server Dashboards
        • Publishing to ADVIZOR Server
        • Server Security
        • Credential Based Filters
        • Embedding Data in a Project
  • Charts and Visual Discovery
    • Charts Overview
      • Bar Chart
        • Inserting a Bar Chart
        • Bar Chart Toolbar
      • Counts
        • Inserting a Counts
        • Counts Toolbar
      • Data Constellation
        • Inserting a Data Constellation
        • Data Constellation Toolbar
      • Data Sheet
        • Inserting a Data Sheet
        • Data Sheet Toolbar
      • Heat Map
        • Inserting a Heat Map
        • Heat Map Toolbar
      • Histogram
        • Inserting a Histogram
        • Histogram Toolbar
      • Line Chart
        • Inserting a Line Chart
        • Line Chart Toolbar
      • Map
        • Inserting a Map
        • Map Toolbar
      • Multiscape
        • Inserting a Multiscape
        • Multiscape Toolbar
      • Parabox
        • Inserting a Parabox
        • Parabox Toolbar
      • Pie Chart
        • Inserting a Pie Chart
        • Pie Chart Toolbar
      • Scatterplot
        • Inserting a Scatter Plot
        • Scatter Plot Toolbar
      • Summary Sheet
        • Inserting a Summary Sheet
        • Summary Sheet Toolbar
      • Text Box
        • Inserting a Text Box
        • Text Box Toolbar
      • Text Filter
        • Inserting a Text Filter
        • Text Filter Toolbar
      • Time Table
        • Inserting a Time Table
        • Time Table Toolbar
    • Recommended Chart Use
    • Visual Discovery
      • Using Colors
      • Selection
      • Managing Viewpoint
      • Missing Values
    • User Interfaces
      • Context Menu
      • Keyboard
  • Release Notes
    • What's New
      • Release 7.2
      • Release 7.1
      • Release 7.0
      • Release 6.8
      • Release 6.7
      • Release 6.6
      • Release 6.4
      • Release 6.3
      • Release 6.2
      • Release 6.2.2
      • Release 6.0
      • Release 5.9
      • Release 5.8.2
      • Release 5.7
      • Release 5.6.2
      • Release 5.6.1
      • Release 5.51
      • Release 5.5
      • Release 5.4.1
      • Release 5.4
      • Release 5.3
      • Release 5.22d
      • Release 5.2 SalesAdvizor
      • Release 5.1
      • Release 5.0.3
      • Release 5.0
    • If You Need Additional Help
    • Copyright
Powered by GitBook
On this page
  • Start with a Question
  • Models
  • Condition data
  • Evaluate the Quality Indicator
  • Interpret the Model
  • Apply the Model to New Data
  • What's Next?
  • See Also
  1. Using ADVIZOR
  2. Model Ribbon

Predictive Analytics: Analyst/X

Visualization empowers the analyst to discover patterns and anomalies in data, by noticing unexpected relationships or by actively searching. Predictive analytics (sometimes called “data mining”) provides a powerful adjunct to this: algorithms are used to find relationships in data, and these relationships can be used with new data to predict values.

Tasks you can do with the predictive analytics in ADVIZOR/X are:

  • Build a model of your data that describes what fields in a table influence the value of a target field.

  • Evaluate the quality of models you build using quality metrics.

  • Examine the model to understand the relationships between the target field and the explanatory fields.

  • Use the model with new data to predict values.

You can also model your ADVIZOR selection state to get a concise description of that set of selected items.

Start with a Question

You begin the process of modeling with a business question. The question must be about the relationships between data fields in a single table. The business question must be in terms of the values of a single field in the data being analyzed, the target field.

The target field must be either a numeric field or an integer field with exactly 2 values, "0" and "1". For example, if you have customer sales data and you want to understand the characteristics of highly profitable customers, than your data table must contain a field with customer profitability; this will be the target in your model.

The target field can be an existing field in a table, or you can create a model on the current selection state from interacting with charts in ADVIZOR Analyst. A new field (with values "0" and "1") can be created from the current selection state, which can then be used as the model target.

The model you create gives the relationship of the single target field with all of the other fields in the same table, the "explanatory fields". So the beginning process is:

  1. Start with a question.

  2. Pick the table in your project that contains data relevant to the question.

  3. Pick a single field that answers the question. Other table fields are "explanatory fields" that determine the value of the target field.

  4. Build a model using ADVIZOR Analyst that describes the relationship between the target field and explanatory fields.

There are two types of models that may be built:

  1. Predict a numeric value, or

  2. Classify data into two classes, where each case in your data is "in" or "out" (has a target field with values of "0" or "1").

Models

A mathematical model is created by predictive analytics. This model describes the relationship between the target field and the explanatory fields in a single data table. Since the model describes the relationship between the target and explanatory fields, there must be values for the target field in every row in the data table. You must have a sufficiently large volume of data to be able to build a valid model that is both relevant and robust. For example, a model that is generated from a data set of 50 rows may not do well when applied to different data , and it may not do a good job of predicting the target values.

The models created by Analyst/X are regression models: mathematical polynomial functions that relate the descriptive attributes (model inputs) and a target attribute (model output). The returned models are expressed as a first degree polynomial expression of the inputs. A polynomial of degree 1 is of the form:

f(X1, X2, ..., Xn) = w0 + w1.X1 + w2.X2 + ... + wn.Xn

where the “w”s are weights and the “X”s are fields. Although higher degree polynomials could also be used to define this relationship, in the large majority of cases a first degree polynomial is sufficient for generation of a relevant and robust model. ADVIZOR/X currently only supports first degree polynomials.

For classification ("0" or "1") models, a slightly different approach called "logistic regression" is used. The result is still a polynomial equation, but the prediction is a "score", the predicted probability of the case/row falling into the "1" or "0" category. This score is used to predict the result; it may also be used to group cases into categories based on how likely they are to fall into the "1" category.

Condition data

A model is the relationship between many explanatory fields and one target field in one table. There are constraints on what fields are usable as explanatory fields. There also may be data in other tables that you want to include in your model as well. Data may need to be conditioned before a model is built; this is described as part of the Analytics Process description.

Evaluate the Quality Indicator

Every model created must first be evaluated for adequacy before it is used. The quality metric is a number between 0.0 and 1.0 that gives the quality of the model. Models may always be compared with each other based on this information indicator.

For ordinary regression models, the information indicator is the "coefficient of determination" (often called R2 or "R squared"). This corresponds to the proportion of information contained in the target field that the explanatory fields are able to explain. For example, a model with an Information indicator of “0.79” explains 79% of the information contained in the target field using the explanatory fields defined.

For classification models, the R2 statistic is not appropriate. A "Percent Concordant" metric is used instead. This does NOT give the amount of variability in the target explained by the model, but it may be used to compare the quality of different models for the same target.

A perfect model would have an indicator of “1”; a random model has an indicator of “0”. A model with an indicator greater than or equal to .95 has excellent predictive power, but any score above 0 indicates some predictive power, better than random results. To improve the Information indicator of a model, add new fields to the data table.

Interpret the Model

After a model has been determined to be adequate based on its quality indicators, it can be used to understand the relationships within the data. The major relationship described by the model is the contributions by variables to predicting the target, how much of the variability of the target is explained by each explanatory field. This information is shown in two pages that are added to your project.

Apply the Model to New Data

Modeling produces an equation that may be saved with the project as an Expression Builder expression. This will be run whenever the project is loaded, so it can be applied whenever your project is regenerated with new data.

What's Next?

See Also

PreviousModel RibbonNextAnalytics Process

Last updated 5 years ago

Read the detailed for using Analyst/X predictive analytics.

Understand the , the user interface to predictive analytics.

Use a model to values in new data.

Read techniques for modeling .

process
Predictive Modeling pane
predict
Zip Codes
Predictive Modeling Pane
Managing Models