Machine-readable commercial infrastructure: markup, product feeds, and catalogs as a language AI can understand

Research question

Why the catalog, markup, and product data feed are no longer a secondary technical add-on, but the language through which the brand speaks to AI about its commercial reality.

Evidence type

Google Search Central documentation on ecommerce and structured data, and OpenAI documentation on product feeds and shopping scenarios.

Freshness of factual claims

The factual basis of the article was updated as of March 2026.

The editorial and machine sides of the catalog

Many companies still talk about visibility in AI as if it were a purely editorial task. The assumption is that they simply need to write better copy, articulate their advantages more clearly, and present the product page more neatly. All of that does matter. But for commercial presence in the answer environment, good writing alone is no longer enough. Answer systems read not only paragraphs; they read structures. They care about price, availability, shipping, returns, product variants, seller identity, the relationship between the organization and the offer, and the degree to which those data are current. And the more of the user’s decisions happen before visiting the site, the more important machine-readable commercial infrastructure becomes—that layer of formalized information a machine can interpret without prolonged guesswork.

Google says this with almost no diplomacy. In its ecommerce documentation, the company recommends combining structured data on the site with a product data feed in Merchant Center rather than relying on only one of those channels [1]. Structured data improves the accuracy of how price, discounts, shipping, and availability are understood, while Merchant Center gives the company more control over assortment and update timing, especially if the catalog is large and changes often [1]. In effect, Google is acknowledging that human-written text and machine-readable product discipline are now two different, but equally important, layers of the brand’s commercial entity.

It is important here to avoid a common misconception. Google says two things at once that, at first glance, seem contradictory: there is no special “AI markup” for AI Overviews and AI Mode, yet structured data, textual accessibility of key content, and Merchant Center freshness still matter critically [9]. In reality, there is no contradiction. Platforms are not waiting for some magical new tag “for AI”; they are waiting for commercial information to be described in the formal structures that search and shopping systems already know how to read. In other words, machine-readable infrastructure is powerful precisely because it is not a hack. It is a discipline of clarity, not a trick for getting around the rules.

For an AI answer, this is critical. Human-written text is good at explaining meaning: what makes a product different, who it was created for, and what jobs it solves. But when a user asks, “how much does it cost,” “is it in stock,” “what are the return terms,” “when will it arrive,” or “which version is right for a particular scenario,” the system should not have to pull that information from free-form description every time like an archaeologist working through loose soil. It needs reliably marked anchor fields. That is exactly why Google’s documentation separately introduces schemas for shipping terms, return policy, product variants, and the store’s organizational context [2][3][4][5]. These elements may look dry at first glance, but they are in fact the language in which the brand speaks to the machine about commercial reality.

What makes a commercial offer machine-readable

From a practical standpoint, commercial machine readability consists of six blocks. The first block is seller identity. Who exactly is selling the product or service? How are the brand, the legal entity, the site, organization profiles, and the catalog connected? The second block is the identity of the product itself: name, model, variant, variant group, attributes, and category membership. The third block is the offer itself: price, currency, availability, discount, and item condition. The fourth is logistics: shipping, delivery timing, geography, and inventory status. The fifth is post-purchase policy: returns, exchanges, and warranty. The sixth is freshness: when all of this was last updated and through which source contour the platform received the signal that something had changed. If even two or three of these blocks are described vaguely, the system is forced to reconstruct the missing pieces from indirect clues. And where the machine has to guess, the brand loses control.

In recent years, Google has been developing exactly this view in a very consistent way. The company separately recommends an organization-level description for return policies so that the same long constructions do not have to be repeated on every product card, while also increasing the chances that the information will appear in brand profiles and knowledge panels [4]. Support for product variant groups is likewise framed as a special class of entity that makes it possible to display variants of the same model more accurately [5]. This leads to an important conclusion: machine-readable infrastructure is needed not only for individual product cards, but also for the brand’s overall commercial self-description.

In 2026, OpenAI effectively arrives at the same conclusions, only within its own shopping scenarios. In the Agentic Commerce documentation, the company offers a structured product feed so that ChatGPT can “accurately index and display” products with current price and availability [6]. The shopping documentation states that merchant ranking depends on metadata about the product and the seller—including availability, price, quality, whether the seller is the manufacturer or the primary seller, and whether Instant Checkout is enabled [7]. What is especially interesting here is that the commercial logic of the answer stops being purely textual. A brand enters the new selection not only because it is well described in prose, but because its data are fit for machine comparison.

This radically changes the meaning of the familiar work around a catalog. In the past, the catalog was often treated as a technical appendage to “real” marketing. Now it becomes part of the brand’s argumentation. If the catalog lacks a clear structure for product variants, AI may assemble the product line incorrectly. If shipping and return terms are not presented anywhere in machine-readable form, the system will understand purchase risk less well. If price and availability are updated slowly, the brand loses not in text, but in the speed of truth. And if product data are stored in several unsynchronized places, what the machine receives is not a brand, but a contradiction.

In this context, it is especially instructive that both Google and OpenAI emphasize the synchronization problem. Google warns that using both the site and Merchant Center at the same time can lead to conflicts and lags, so it is useful to enable automatic item updates, especially when price or availability changes frequently [1]. OpenAI writes directly that there may be a delay when prices and shipping terms are updated, and that a direct feed is needed precisely to reduce the gap between the catalog’s actual state and what ChatGPT shows [7]. In other words, the central task here is not simply to “mark up the site,” but to build a coordinated data flow between the brand’s internal systems and external answer platforms.

Synchronization as a hidden discipline

For ai100, this topic is especially valuable because it connects the marketing conversation to the company’s engineering reality. Visibility in AI stops being a matter only for editors and organic traffic specialists. It now includes product information systems, catalog update processes, data quality control, feed synchronization, card architecture, and the discipline of the corporate entity registry. In management language, that means one thing: commercial visibility in the answer environment becomes a function not only of content, but of the business’s operational maturity.

From a research perspective, this opens an almost boundless field. One can compare brands within the same category by the completeness of their machine-readable description, examine which elements most often correlate with correct display in AI scenarios, measure how the presence of structured data and feeds affects the accuracy of pricing and logistics answers, and track which types of errors arise most often when an organization-level return policy or product groups are missing. Such a body of data would be useful not only as a media asset, but also as the foundation for a consulting product.

The engineering contour as a condition of marketing

But the main conclusion lies deeper than the technical layer. Machine-readable commercial infrastructure is not a way to please the algorithm. It is a way to make the brand’s business entity clear enough that the machine does not distort it on the way to the user. The market is gradually moving away from a world in which the user went to the site and manually assembled a picture from scattered tabs. In the new world, part of that assembly is performed by an intermediary. If the brand has not given that intermediary a precise language for describing its offer, the intermediary will inevitably start filling in the gaps. And where that gap-filling begins, controllability ends.

That is why the best commercial brand of the next few years will win not only through product quality and sharp positioning, but through the engineering of its own truth. Whoever can express assortment, price, terms, and the seller’s role in machine-readable form gives AI less room for error and more grounds for confident display. In the answer environment, this is no longer a secondary technical detail. It is a new grammar of trust.

What seems well established

It is well established that both Google and OpenAI are moving toward a more structured way of representing commercial data. Without machine-readable infrastructure, a brand risks being understood only partially or in an outdated way.

What still remains uncertain

Less certain is which combinations of fields and structures will prove most advantageous in each vertical, and how quickly markets will develop stable best practices for this new commercial layer.

What this changes in practice

The practical value here is that the catalog needs to be designed as part of the brand’s semantic infrastructure. It is no longer a secondary technical appendage, but a condition for an accurate commercial answer.

Sources

[1] Google Search Central. Share your product data with Google. 2025-2026

[2] Google Search Central. Structured data for shipping details. 2025-2026

[3] Google Search Central. Structured data for return policies. 2025-2026

[4] Google Search Central Blog. Supporting organization-level return policies and loyalty programs. 2025

[5] Google Search Central Blog. New merchant listing support for product variants. 2024-2025

[6] OpenAI Developers. Agentic Commerce - Products. 2026

[7] OpenAI Help Center. Shopping with ChatGPT Search. 2026

[8] OpenAI Help Center. ChatGPT Release Notes - March 24, 2026 Shopping updates. 2026

Related materials

Next step

How technical infrastructure affects visibility

Catalogues, structured markup, and product feeds are a language machines understand. In AI100 report growth cases, the technical readiness of a site often turns out to be one of the levers for boosting visibility.

See growth cases in the sample report →