This report focuses on ML/AI platforms, with data on the following vendors:
Altair | Amazon (SageMaker) | Anaconda | C3 AI | Cloudera (AI) | Databricks (Mosaic AI) | Dataiku | DataRobot | Domino | Google (Vertex AI) | H2O.ai | Hugging Face | IBM (watsonx) | Microsoft (Azure Machine Learning) | Oracle (Machine Learning) | SAS | Snowflake (ML) | TensorFlow | Weights & Biases
While many organizations have long explored ML/AI for various business use cases, often starting with open-source programming languages like R and Python and open-source tools like Jupyter Notebook and TensorFlow, today, the most mature-and-established ML/AI applications leverage robust platforms to manage at an enterprise scale. As ML/AI programs evolve, reproducibility and reuse of ML models become important, as well as all of the governance and data integration activities and monitoring that are part of an end-to-end MLOps frameworks. There are some vendors, too, that offer turnkey, business user-friendly solutions focused on “democratizing” data science, allowing organizations to ramp up ML/AI efforts quickly. Generative AI also looms large, fully arriving in the zeitgeist with OpenAI’s public release of ChatGPT in late 2022. Thanks to these recent innovations in generative AI and large language models (LLMs), AI has become even more accessible, and its applicability even more apparent for a range of use cases. Most organizations are now exploring possible avenues for ML/AI, and vendors in a variety of sectors – from security to robotic process automation, to enterprise applications – are weaving AI capabilities into their core product offerings.
The ML/AI market reflects this diversity of maturity and use cases, with classes of vendors that speak to the needs of different organizations. Large public cloud platforms like Microsoft, AWS, and Google offer full MLOps capabilities for enterprise-scale data science programs, competing alongside other popular end-to-end offerings like Oracle and IBM, and open-source packages like TensorFlow and Anaconda. Another class of tools aims to simplify the complexity of data science work by offering business-user-friendly products like pre-trained ML models to speed time to business value, such as DataRobot, C3.ai, H2O.ai, and Hugging Face. Still, others have broadened their appeal as ML/AI platforms by focusing earlier in the data and analytics pipeline, such as Databricks with its popular data lakehouse paradigm for data management; Snowflake with its solid data warehousing foundation; and Dataiku with its finesse as a data preparation tool. Across enterprise, we see varied spending and utilization on these many ML/AI vendors, with the more robust MLOps offerings occupying leading positions.
It is important to note that the ML/AI market is a complicated and extremely fast-moving space. In the past two years, the generative AI boom has supercharged the ML/AI race with new generative AI use cases and LLMs. Foundational AI models have proliferated, and nearly every enterprise tech product across every domain has rolled out new AI features and capabilities. Certainly, too, the market is flooded with AI-focused branding, a reorientation around new interest in these cutting-edge technologies. ETR closely tracks these developments in generative AI and LLMs with our AI Product Series, a six-times-per-year survey tracking spending, utilization, and perceived value for dozens of AI tools and features, including foundational AI models and AI features embedded in broader products. Thus, foundational models and specific AI features embedded in other tools have been excluded from the present study. This Observatory for ML/AI instead focuses on more robust, end-to-end, complete platforms capable of orchestrating ML/AI programs at enterprise scale. Or, at the very least, contained tool packages that some organizations, especially small and midsize firms, may deploy to tackle complete ML/AI use cases. Although many open-source ML/AI tools are widely used in enterprises, they have been excluded from this study in favor of paid tools and products that have clearer sales motions as “freemium” offerings with significant spending intentions data.
The ETR Observatory for ML/AI surveyed 314 IT decision makers. Most (63%) represent Large enterprises of more than 1,200 employees, with more than a fifth (22%) at Fortune 500 firms and nearly a third (31%) at Global 2000 enterprises. The three most representative industry verticals are Services/Consulting, IT/TelCo, and Financials/Insurance, collectively comprising more than half (55%) of the sample. Almost three-quarters (72%) of respondents are in North America and 20% are in EMEA, with the remainder representing APAC (8%) and Latin American (<1%) regions. About half (51%) of respondents hold VP or Director-level titles, and the remainder are split between C-level roles (31%) and practitioner roles (18%).
As with any enterprise IT tool market, ML/AI tools have evolved to fit the needs of a variety of organizations that are all maturing their advanced analytics initiatives at different rates. What is clear, however, is that the early stages of exploration in ML/AI have, for many organizations – and especially larger enterprises – reached a point of widespread enterprise-scale adoption, requiring robust end-to-end data architectures and sophisticated management and governance programs. Smaller organizations or those just now dipping their toes in the waters of data science are finding early value in free, open-source offerings or tools with pre-built, pre-trained ML models ready to accelerate time to business value. With the broad awareness of generative AI technology in the last two years, too, vendors and organizations alike are scrambling to imagine new possibilities with AI to leverage the power of LLMs.
To respond to these organizational stages, the ML/AI vendor landscape is dotted with big players offering full MLOps capabilities for mature data science operations, with large public cloud players like Microsoft, AWS, and Google leading the way alongside open-source stalwarts TensorFlow and Anaconda and popular open libraries and languages like Jupyter Notebook, Python, and R. Other vendors have built reputations for strength in particular areas, like Databricks, Snowflake, and Dataiku, which offer sophisticated data management setups that feed data science use cases as well as more line-of-business-focused reporting and self-service business intelligence use cases. A separate set of vendors aim for democratizing data science, with off-the-shelf, turnkey ML solutions that are alluring to business executives and less technically savvy business users.