Please use this glossary to familiarise yourself with some of the more technical terms we use on the site
- Algorithm
A series of business rules written in programming code to run a computational process - such as allocating categories or probabilities to a customer file.
- Categorical Variable
A variable containing absolute values. Examples include Gender, State & Marital Status.
- CHAID
A decision tree approach to segmentation.
- Clustering
Clustering is a statistical technique used to classify objects in to different groups based on some similarity or similarities
- Continuous Variable
A variable containing values on a continuous scale. Examples include Age, Income & Height.
- Control Group
A group of individuals or entities that are isolated from a testing environment and can therefore be used as a benchmark.
- Correlation
Measure of the “strength” of a relationship between two variables i.e. the level to which variables are related.
- Dependent variable
The event or value that one attempts to predict when using predictive modelling.
- Dummy Variable
The creation of a new variable to represent the value from another variable. For example, dummy variables to represent Male & Female could be created from a Gender variable.
- ETL
The Extraction, Transformation and Loading of data.
- Gains Chart
A graphical representation demonstrating the performance of a predictive model.
- Geodemographics
The classification of a population into groups based on geographic and demographic characteristics.
- Independent Variable
The information that one uses to help predict an event or value.
- LTV
Lifetime Value - The calculation of a customer's value to an organisation over their 'lifetime'.
- NPV
Net Present Value - The process of calculating the value of money at a point in time.
- Predictive modelling
Statical process to predict the future behaviour of a customer
- Rank
To place records in order based on a value or set of values.
- RFM
A segmentation approach driven by Recency, Frequency & Monetary Value characteristics.
- Scorecard
A predictive model outputs a scorecard which asigns a probability score to a customer record indicating their likelihood to behave in certain way e.g. likelihood to churn
- Segmentation
The separation of individuals or entities into groups that share similar characteristics or needs.
- Statistically Significant
The process of calculating the likelihood that a result can be attributed to something other than chance.
- Time Series
The analysis of data over a period of time.