How to Outsource Analytics
- Multiple Model Discounts—Less for second or third models within a project?
- Pre-developed (off-the-shelf) Models—These can be "much" cheaper than custom models, while not custom-fitted.
- Acquisition vs. CRM—Employing client-specific variables certainly increases the cost.
- Regression Models vs. Other Types—At times, types of techniques may affect the price.
- Clustering and Segmentations—They are generally priced much higher than target-specific models.
Again, it really depends on the complexity factor more than anything else, and the pre- and post-modeling process must be estimated and priced separately. Non-modeling charges often add up fast, and you should ask for unit prices and minimum charges for each step.
Scoring charges in time can be expensive, too, so negotiate for discounts for routine scoring of the same models. Some may offer all-inclusive package pricing for everything. The important thing is that you must be consistent with the checklist when shopping around with multiple candidates.
7. Documentation: When you pay for a custom model (not pre-developed, off-the-shelf ones), you get to own the algorithm. Because algorithms are not tangible items, the knowledge is to be transformed in model documents. Beware of the ones who offer "black-box" solutions with comments like, "Oh, it will work, so trust us."
Good model documents must include the following, at the minimum:
- Target and Comparison Universe Definitions: What was the target variable (or "dependent" variable) and how was it defined? How was the comparison universe defined? Was there any "pre-selection" for either of the universes? These are the most important factors in any model—even more than the mechanics of the model itself.
- List of Variables: What are the "independent" variables? How were they transformed or binned? From where did they originate? Often, these model variables describe the nature of the model, and they should make intuitive sense.
- Model Algorithms: What is the actual algorithm? What are the assigned weight for each independent variable?
- Gains Chart: We need to examine potential effectiveness of the model. What are the "gains" for each model group, from top to bottom (e.g., 320 percent gain at the top model group in comparison to the whole universe)? How fast do such gains decrease as we move down the scale? How do the gains factors compare against the validation sample? A graphic representation would be nice, too.
For custom models, it is customary to have a formal model presentation, full documentation and scoring script in designated programming languages. In addition, if client files are provided, ask for a waterfall report that details input and output counts of each step. After the model scoring, it is also customary for the vendor to provide a scored universe count by model group. You will be shocked to find out that many so-called analytical vendors do not provide thorough documentation. Therefore, it is recommended to ask for sample documents upfront.
Stephen H. Yu is a world-class database marketer. He has a proven track record in comprehensive strategic planning and tactical execution, effectively bridging the gap between the marketing and technology world with a balanced view obtained from more than 30 years of experience in best practices of database marketing. Currently, Yu is president and chief consultant at Willow Data Strategy. Previously, he was the head of analytics and insights at eClerx, and VP, Data Strategy & Analytics at Infogroup. Prior to that, Yu was the founding CTO of I-Behavior Inc., which pioneered the use of SKU-level behavioral data. “As a long-time data player with plenty of battle experiences, I would like to share my thoughts and knowledge that I obtained from being a bridge person between the marketing world and the technology world. In the end, data and analytics are just tools for decision-makers; let’s think about what we should be (or shouldn’t be) doing with them first. And the tools must be wielded properly to meet the goals, so let me share some useful tricks in database design, data refinement process and analytics.” Reach him at email@example.com.