Developed by the Data Mining Group, an independent, vendor led committee, PMML provides an open standard for representing data mining models. In this way, models can easily be shared between different applications avoiding proprietary issues and incompatibilities. Currently, all major commercial and open source data mining tools already support PMML
PMML is an XML-based language which follows a very intuitive structure to describe data pre- and post-processing as well as predictive algorithms. Not only does PMML represent a wide range of statistical techniques, but it can also be used to represent input data as well as the data transformations necessary to transform raw data into meaningful features.
As part of the Data Mining Group, Zementis is committed to the continual development of PMML. It is our vision for the community that users will be free to share models among many solutions, benefiting from an environment in which interoperability is truly attainable. For this reason, we have made available to the data mining community the Transformations Generator. It allows users to interactively design data transformations in PMML.
PMML Presentation
| Zementis presentation on PMML and Predictive Analytics to the ACM Data Mining Bay Area/SF group at the LinkedIn auditorium in Sunnyvale, CA. In this talk, Dr. Alex Guazzelli, Zementis VP of Analytics, provides the business rationale behind PMML as well an overview of its main components. Besides being able to describe the most common modeling techniques, as of version 4.0, released in 2009, PMML is also capable of handling complex pre-processing tasks. As of version 4.1, released in December 2011, PMML has also incorporated complex post-processing to its structure as well as the ability to represent model ensemble, segmentation, chaining, and composition within a single language element. This combined representation power, in which an entire predictive solution (from pre-processing to model(s) to post-processing) can be represented in a single PMML file, attests to the language's refinement and maturity. |
PMML Community Forum
For an on-going discussion and to read about the latest PMML news, we would like to invite you to join the PMML group in LinkedIn or the discussion forum in the PMML group on Analytic Bridge, a social network community for analytics professionals.
PMML Book
PMML in Action is a great way to learn how to represent your predictive solutions through a mature and refined open standard. For the 2nd edition, the book has been completely revised for PMML 4.1, the latest version of PMML. It includes new chapters and an expanded description of how to represent multiple models in PMML, including model ensemble, segmentation, chaining, and composition. The book is divided into six parts, taking you in a PMML journey in which language elements and attributes are used to represent not only modeling techniques but also data pre- and post-processing.
With PMML, users benefit from a single and concise standard to represent predictive models, thus avoiding the need for custom code and proprietary solutions.
You too can join the PMML movement! Unleash the power of predictive analytics and data mining today!
Available for purchase on Amazon.com
Reviews:
"The very first book that covers the industry standard for transferring and integrating predictive models across systems, this is a milestone for predictive analytics. If you want the long and short on engineering for versatility in how predictive models can be deployed and put to work, get started by curling up with this book."
Eric Siegel, Ph.D., President, Prediction Impact, Inc., Conference Chair, Predictive Analytics World (Predictive Analytics World)
"Open standards facilitate innovation and progress (web is a great example). PMML (the Predictive Model Markup Language) is an open standard for predictive analytics and data mining, developed over more than 12 years and supported by most industry leaders. This easy to read book covers data transformations, many modeling methods (Associations, Clustering, Decision Trees, Neural Nets, Regression, SVM, and more), model ensembles, and verification. This book is your essential guide to PMML!"
Gregory Piatetsky, Ph.D., Editor KDnuggets, Founder KDD/SIGKDD (KDNuggets.com)
"Next generation enterprise are going to be driven by analytics, especially predictive analytics. Sharing and rapidly deploying predictive analytic models is essential and PMML is the open standard that delivers the interoperability and agility that these predictive enterprises need."
James Taylor, CEO, Decision Management Solutions, Co-author of "Smart (Enough) Systems: How to Deliver Competitive Advantage by Automating Hidden Decisions" (JTonEDM.com)
"PMML in Action" may be destined to become an analog to the famous Kernighan and Richie book, "The C Programming Language", published in 1978. This book (affectionately known as K&R) became the standard guide for ANSII C programming practice. I expect that "PMML in Action" will function likewise in the burgeoning development of PMML in analytical tools now, and in the future. It is the "cookbook" for PMML programming. Julia Child made French cuisine kiss-simple for housewives to create. Now, programmers can follow the descriptions and practices in this book to implement analytical solutions in PMML as easily and efficiently as Julia enabled a housewife to make a French soufflé."
Robert A. Nisbet, Ph.D., (Co-author of "Handbook of Statistical Analysis & Data Mining Applications")
PMML Links
We have compiled a list of useful PMML links below. Please, make sure to check them if you would like to become a PMML pro.
- Book - PMML in Action: Unleashing the Power of Open Standards for Data Mining and Predictive Analytics.
- Data Mining Group Home
- PMML 4.0 Specification - supported by most vendors.
- PMML 4.1 Specification - released December 31, 2011.
- PMML page on wikipedia
- IBM developerWorks Article - What is PMML? A great introduction to PMML.
- IBM developerWorks Article - Representing Predictive Solutions in PMML: Describes how data pre-processing and model are represented in PMML.
- Zementis Community Forums: Explore our PMML forums. Learn from the pros and share your experience.
- Predictive Analytics and PMML Blog: All things PMML.
- Zementis Blog: Issues and tips on how to export PMML from your favorite modeling tool.
- LinkedIn PMML Group: Join the PMML discussion group in LinkedIn.

