Starburst Joins OSI to Standardize Semantic AI Data

The News

Starburst announced its participation in the Open Semantic Interchange (OSI), an open-source initiative led by Snowflake and ecosystem partners to standardize semantic metadata definitions across data tools, platforms, BI systems, and AI workflows. OSI aims to create a vendor-neutral semantic model specification that unifies business metrics and improves interoperability across dashboards, notebooks, and machine learning pipelines.

Analysis

The Industry’s Push Toward Semantic Consistency Gains Momentum

As enterprises accelerate AI adoption, the fragmentation of data definitions has emerged as a critical inhibitor to scale. theCUBE Research and ECI’s Day 0 and Day 2 findings illustrate the scope of the challenge:

70.4% of organizations cite AI/ML as a top spending priority, yet inconsistent data definitions slow downstream adoption.
59.4% identify automation and AIOps as required to accelerate operations, showing the need for machine-readable, standardized semantics to power automated pipelines.
Organizations operate across an average of 3–5 cloud providers, making semantic interoperability indispensable for distributed data architectures.

This rising complexity is pushing enterprises (and developers) to prioritize open standards that ensure semantics travel consistently across tools, environments, and AI systems.

How OSI Shapes the Modern Data and AI Landscape

The Open Semantic Interchange aims to reduce semantic drift across the application lifecycle by introducing a vendor-neutral specification for business metrics and definitions. For the developer community, this matters because AI workflows increasingly depend on shared, interoperable metadata across pipelines.

Starburst joining OSI reinforces the industry’s shift toward open, standardized, and interoperable data ecosystems. With Trino, Iceberg, and a federated architecture, Starburst’s participation signals alignment with a broader movement to reduce format lock-in and simplify AI application development.

In practice, OSI may help create more portable semantic layers, which would allow developers to move metrics definitions across BI tools, semantic layers, and ML/AI systems without rework. While results vary case by case, the initiative introduces the possibility of more consistent cross-platform logic and faster adoption of AI-driven analytics.

Current Market Challenges and Insights

Today’s enterprise data stacks face a convergence of pressures that make semantic standardization increasingly critical:

Semantic Fragmentation Slows AI Adoption

Most enterprises maintain multiple BI tools, data catalogs, notebooks, and ML workflows, each with their own definitions for metrics like revenue, conversion rates, or compliance KPIs. This creates conflicting definitions across tools, increased reconciliation effort, higher cost of governance, and difficulty applying AI models consistently.

ECI Day 0 findings show 89.3% of organizations maintain a centralized API repository, but semantics remain dispersed, revealing a persistent gap between data access and business meaning.

Data Teams Are Overwhelmed by Tool Sprawl

Day 2 data shows 29% of organizations use 16–20 observability tools, further highlighting ecosystem fragmentation. This sprawl multiplies semantic inconsistencies and complicates cross-platform lineage, governance, and AI model training.

AI Demands Standardized Business Context

Developers building AI-native applications require standardized semantic layers to power:

Feature engineering
Data product generation
LLM grounding
Business metric validation
Agentic workflows

With 80.5% of organizations already using AI for performance optimization, downstream needs for consistent metadata are becoming unavoidable.

Multi-Cloud Architectures Increase Semantic Divergence

With 54.4% of organizations operating hybrid and growing use of multi-cloud, definitions tend to drift as data moves between systems. This increases operational risk, slows governance, and adds friction to AI model deployment.

Together, these market pressures explain why open, cross-platform semantic standards are becoming a central requirement instead of a nice-to-have.

How OSI May Influence Developers Going Forward

Developer outcomes depend on adoption, tooling maturity, and ecosystem participation, but OSI could influence workflows in several ways:

More consistent metric logic across tools, reducing the need to duplicate definitions in BI dashboards, notebooks, ML pipelines, and semantic layers.
Improved interoperability when integrating distributed systems, data lakes, lakehouses, and AI services.
Clearer governance patterns, allowing semantic definitions to be versioned, reviewed, and maintained like code.
Simplified onboarding as new applications and AI agents can understand enterprise-wide metrics through shared schemas or definitions.
Faster iteration on AI/ML models as standardized semantics improve feature consistency and reduce data drift risks.

Developers should expect to see more cross-tool compatibility, increased semantic portability, and more consistent logic across data products as OSI evolves.

Looking Ahead

The market is clearly pivoting toward unified semantic layers as a prerequisite for enterprise AI. Standards like OSI have the potential to reduce complexity, streamline interoperability, and make it easier for organizations to leverage AI across multi-cloud environments. As more vendors adopt open specifications, developers may gain a more predictable and consistent foundation for building analytics and AI applications.

Starburst’s participation reinforces its commitment to openness in the data ecosystem. By investing in community-driven specifications and vendor-neutral interoperability, it positions itself to support enterprises pursuing federated, AI-ready architectures. What comes next will likely include deeper integrations across BI tools, shared semantic repositories, and expanding support across cloud providers, all key enablers for the next generation of data-driven and AI-native applications.

Deepfake Detection Economics Shift Toward Always-On Voice Security

Paul Nashawaty

Paul Nashawaty, Practice Leader and Lead Principal Analyst, specializes in application modernization across build, release and operations. With a wealth of expertise in digital transformation initiatives spanning front-end and back-end systems, he also possesses comprehensive knowledge of the underlying infrastructure ecosystem crucial for supporting modernization endeavors. With over 25 years of experience, Paul has a proven track record in implementing effective go-to-market strategies, including the identification of new market channels, the growth and cultivation of partner ecosystems, and the successful execution of strategic plans resulting in positive business outcomes for his clients.

View all posts
Samantha Weston

With over 15 years of hands-on experience in operations roles across legal, financial, and technology sectors, Sam Weston brings deep expertise in the systems that power modern enterprises such as ERP, CRM, HCM, CX, and beyond. Her career has spanned the full spectrum of enterprise applications, from optimizing business processes and managing platforms to leading digital transformation initiatives.

Sam has transitioned her expertise into the analyst arena, focusing on enterprise applications and the evolving role they play in business productivity and transformation. She provides independent insights that bridge technology capabilities with business outcomes, helping organizations and vendors alike navigate a changing enterprise software landscape.

View all posts

Starburst Joins OSI to Accelerate Open, Interoperable Semantic Standards for AI

The News

Analysis

The Industry’s Push Toward Semantic Consistency Gains Momentum

How OSI Shapes the Modern Data and AI Landscape

Current Market Challenges and Insights

Semantic Fragmentation Slows AI Adoption

Data Teams Are Overwhelmed by Tool Sprawl

AI Demands Standardized Business Context

Multi-Cloud Architectures Increase Semantic Divergence

How OSI May Influence Developers Going Forward

Looking Ahead

Deepfake Detection Economics Shift Toward Always-On Voice Security

Logs-First Observability Challenges the Three-Pillar Model

Security Data Overload Threatens Visibility in AI-Driven Enterprises

Lean Infrastructure Powers Massive Scale in AI-Era Web Platforms

AI Discovery Engines Reshape Content Platforms as Real-Time Systems

Autonomous Infrastructure Agents Push Linux Ops Toward AI Control Planes

Authors