Tuesday, November 6, 2012

Big Data and Real-Time Scoring with ADAPA and the Universal PMML Plug-in

PMML, the Predictive Model Markup Language, allows for predictive models to be easily moved into production and operationally deployed on-site, in the cloud, in-database or Hadoop. Zementis offers a range of products that make possible the deployment of predictive solutions and data mining models built in IBM SPSS, SAS, StatSoft STATISTICA, KNIME, SAP KXEN, R, etc. Our products include the ADAPA Scoring Engine and the Universal PMML Plug-in (UPPI). 


ADAPA, the Babylonian god of wisdom, is the first PMML-based, real-time predictive decisioning engine available on the market, and the first scoring engine accessible on the Amazon Cloud and IBM SmartCloud as a service. ADAPA on the Cloud combines the benefits of Software as a Service (SaaS) with the scalability of cloud computing. ADAPA is also available as a traditional software license for deployment on site.

As even the god of wisdom knows, not all analytic tasks are born the same. If one is confronted with massive volumes of data that need to be scored on a regular basis, in-database scoring sounds like the logical thing to do. In all likelihood, the data in these cases is already stored in a database and, with in-database scoring, there is no data movement. Data and models reside together; hence, scores and predictions flow at an accelerated pace. ADAPA’s sister product, the Universal PMML Plug-in (UPPI), is the Zementis solution for Hadoop and in-database scoring. UPPI is available for the IBM Netezza appliance, SAP Sybase IQ, and EMC Greenplum/Pivotal, Teradata and Teradata Aster. It is also available for Hadoop/Datameer. 


ADAPA and UPPI consume model files that conform to the PMML standard, version 2.0 through 4.2. If your model development environment exports an older version of PMML, our products will automatically convert your file into a 4.2 compliant format. 

Our products support an extensive collection of statistical and data mining algorithms. These include:
  • Neural Networks (Back-Propagation, Radial-Basis Function, and Neural-Gas) 
  • Regression Models (Linear, Polynomial, and Logistic)
  • General Regression Models (General Linear, Ordinal Multinomial, Generalized Linear, Cox) 
  • Support Vector Machines (for regression and multi-class and binary classification) 
  • Decision Trees (for classification and regression)
  • Scorecards (including support for reason codes and complex attributes) 
  • Association Rules 
  • Ruleset Models (flat Decision Trees)
  • Clustering Models (Distribution-Based, Center-Based, and 2-Step Clustering) 
  • Naive Bayes Classifiers 
  • Multiple Models (model composition, chaining, segmentation, and ensemble - including Random Forest Models and Stochastic Boosting)
A myriad of functions for implementing data pre- and post-processing are also supported, including:
  • Text Mining (introduced in PMML 4.2)
  • Regular Expressions
  • Value Mapping
  • Discretization
  • Normalization
  • Scaling
  • Logical and Arithmetic Operators
  • Conditional Logic
  • Built-in Functions
  • Lookup Tables
  • Business Decisions and Thresholds
  • Custom Functions ... and much much more

Visit us on the web: www.zementis.com
Follow us on twitter: @Zementis
Or send us an e-mail at info@zementis.com

No comments:

Post a Comment

Welcome to the World of Predictive Analytics!

© Predictive Analytics by Zementis, Inc. - All Rights Reserved.

Copyright © 2009 Zementis Incorporated. All rights reserved.

Privacy - Terms Of Use - Contact Us