16-minute read • Based on data collected April 2024
While structuring a grouping of disparate vendors with varying functionalities is subjective, the ETR Observatory for ML/AI vendors categorizes by placement in the Observatory Scope primarily, which breaks down the data-driven plotting of each vendor into four vectors. It is important to remind our readers that all ETR Observatory reports are based solely on evaluator data and feedback, not vendor involvement. This report examines the below selection of ML/AI vendors by triangulating data from ETR’s Market Array for ML/AI Tools, Technology Spending Intentions Survey (TSIS), Emerging Technology Survey (ETS), commentary from ETR Insights Interviews with IT decision makers (ITDMs) from the ETR Community, and industry analysis by our research staff. TSIS data measures spending velocity on a vendor or product based on ETR’s proprietary Net Score and Pervasion measures. ETR Insights interviews provide qualitative context and vendor evaluation to complement the data.
This report focuses on the following vendors: Amazon (SageMaker) | Anaconda | Anthropic (Claude) | C3.ai | Cohere | Databricks | Dataiku | DataRobot | Google (Vertex AI) | H2O.ai | Hugging Face | IBM Watson | Jasper | Meta Llama | Microsoft (Azure Machine Learning) | OpenAI (ChatGPT) | Oracle | TensorFlow
The ML/AI sector today tracks vendors targeting organizations in the full spectrum of maturity with regard to data science programs. Organizations with more mature programs largely built data science teams that embraced a variety of tools, most certainly open-source programming languages like R and Python and open-source tools like Jupyter Notebook and TensorFlow. But as programs evolve, reproducibility and reuse of ML models become important, as well as all of the governance and data integration activities and monitoring that are part of end-to-end MLOps frameworks. Organizations that have more recently begun their data science journeys may leap-frog these phases given a variety of turnkey, business user-friendly solutions from today’s vendors focused on “democratizing” data science. No matter the stage of maturity, however, there is no denying the impact of generative AI’s recent emergence in the zeitgeist. It seems most organizations are now at least exploring possible use cases for ML/AI, and vendors in a variety of sectors – from security to RPA to enterprise applications – are weaving AI capabilities into their core product offerings.
The ML/AI market reflects this diversity of maturity and use cases, with classes of vendors that speak to the needs of different organizations. Large public cloud platforms like Microsoft, AWS, and Google offer full MLOps capabilities for enterprise-scale data science programs, competing alongside other popular end-to-end offerings like Oracle and IBM, and open-source packages like TensorFlow and Anaconda. Generative AI-focused offerings like Meta Llama, Anthropic’s Claude, and OpenAI’s ChatGPT offer large language models (LLMs) to help accelerate organizations’ generative AI use cases. Another class of tools aims to simplify the complexity of data science work by offering business-user-friendly products like pre-trained ML models to speed time to business value, such as DataRobot, C3.ai, H2O.ai, and Hugging Face. Still, others have broadened their appeal as ML/AI platforms by focusing earlier in the data and analytics pipeline, such as Databricks with its popular data lakehouse paradigm for data management or Dataiku with its finesse as a data preparation tool. Acknowledging Snowflake’s growing presence in this space, ETR intends to begin tracking the company in our ML/AI sector beginning with the next TSIS period. Snowflake’s positioning in this market will be included in future reports. Across these many ML/AI vendors, we see varying levels of spending and utilization across enterprises, with the more robust MLOps offerings and the generative AI-focused products occupying leading positions.
Spending Intentions
Figure 2. ETR’s Market Array spending Net Scores for ML/AI vendors were derived from a survey of 300 ML/AI users and evaluators.
Figure 2 above shows Market Array Net Score for vendors within the ML/AI marketplace, tracking forward-looking spending trajectory for each vendor’s ML/AI-specific offerings. This differs from ETR’s TSIS, which tracks overall spending projections at the company- and sector-wide levels. The data visualized in this figure will be referenced throughout this Observatory report.
Microsoft Azure Machine Learning holds the highest Net Score in this subsector at 86.1%, driven by the highest portion of Increase spending indications in the survey and a healthy Adoption rate of 16%, with zero users reporting plans to Replace the vendor in the next year. After Microsoft, three generative AI-focused vendors have the next highest Net Scores: OpenAI (84.3%), Anthropic (75.0%), and Meta Llama (70.6%). Cohere, Amazon SageMaker, Google (Vertex AI), Databricks, and Hugging Face follow after these top four in Net Score. Each of these vendors show minimal churn indications combined with elevated positive spending plans.
Toward the middle of the pack are vendors with sizable flat spending indications but still positive spending indications and minimal negative indications. The Net Scores for H2O.ai (53.3%), Dataiku (50.0%), Jasper (50.0%), and TensorFlow (45.6%) are robust and healthy.
More negative spending plans combined with sizable flat indications and fewer positive spending plans explain lower Net Scores for vendors like DataRobot (41.7%), IBM Watson (40.6%), C3.ai (31.3%), Oracle (20.4%), and Anaconda (18.8%). Though these vendors have the lowest Net Scores relative to their subsector peers in this survey, the Net Scores for DataRobot, IBM Watson, and C3.ai are elevated and in line with spending plans in many other sectors. This speaks to the general health of the ML/AI market at this moment, where many organizations are pursuing investment in these technologies.
Most Innovative
Figure 3. ETR’s Market Array tracks the “Most Desired” and “Most Innovative” vendors. The above depicts a small section of the most innovative vendors. The full analysis is available via the Market Array data set.
Expected Churn
Figure 4. This chart visualizes the Market Array data for ETR’s Expected Churn metric, which measures how long customers expect to use a product. The full analysis is available via the Market Array data set.
Usage Change
Figure 5. Regardless of spending intentions, ETR’s Usage Change analysis measures organizational changes in utilization levels of products. The full analysis is available via the Market Array data set.
As with any enterprise IT tool market, ML/AI tools have evolved to fit the needs of a variety of organizations that are all maturing their advanced analytics initiatives at different rates. What is clear, however, is that the early stages of exploration in ML/AI have, for many organizations – and especially larger enterprises – reached a point of widespread enterprise-scale adoption, requiring robust end-to-end data architectures and sophisticated management and governance programs. Smaller organizations or those just now dipping their toes in the waters of data science are finding early value in free, open-source offerings or tools with pre-built, pre-trained ML models ready to accelerate time to business value. With the broad awareness of generative AI technology in the last year, vendors and organizations alike are scrambling to imagine new possibilities with AI to leverage the power of LLMs.
To respond to these organizational stages, the ML/AI vendor landscape is dotted with big players offering full MLOps capabilities for mature data science operations, with large public cloud players like Microsoft, AWS, and Google leading the way alongside open-source stalwarts TensorFlow and Anaconda and popular open libraries and languages like Jupyter Notebook, Python, and R. Other vendors have built reputations for strength in particular areas, like Databricks and Dataiku, which offer sophisticated data management setups that feed data science use cases as well as more line-of-business-focused reporting and self-service business intelligence use cases. A separate set of vendors aim for democratizing data science, with off-the-shelf, turnkey ML solutions that are alluring to business executives and less technically savvy business users.