All posts
Ray Iyer
Ray Iyer
Co-founder, Anglera

Getting footwear products recommended by ChatGPT, Gemini, and AI shopping

Footwear shoppers now ask ChatGPT and Gemini for recommendations before a search engine. Here's why thin product data keeps brands invisible to AI.

Getting footwear products recommended by ChatGPT, Gemini, and AI shopping

A shopper looking for a trail running shoe no longer starts with a search bar full of blue links. They ask ChatGPT, Gemini, or Google's AI Mode a full sentence: what's the best option, for this foot, this budget, this use case. The engine answers with a short list, and most footwear brands never make that list, not because the product is wrong for the shopper, but because the data behind it never told the AI enough to include it.

The channel shift is not hypothetical anymore

AI-driven traffic to U.S. retail sites rose 393% year over year in Q1 2026, and shoppers arriving from AI sources are converting 42% better than traffic from any other channel, a full reversal from a year earlier when AI traffic converted worse than everyone else, according to Adobe's Q1 2026 traffic report covered by TechCrunch. Adobe's own writeup is blunter about the cause: most retail sites still aren't built to be read by machines, which is a data problem before it's a marketing problem.

OpenAI has been building the retrieval side of this to match. Its product discovery documentation describes ChatGPT parsing structured feeds and evaluating specific attributes against a shopper's query, then ranking matches, not paging through product descriptions written for a human eye. Instant Checkout, live since September 2025, lets a shopper buy the shoe without leaving the chat, which means the moment your product gets recommended is closer to the moment it gets purchased than it has ever been in ecommerce.

Google's Shopping Graph, meanwhile, now indexes more than 60 billion product listings with roughly 2 billion updates per hour, a scale confirmed at Google I/O 2026 and detailed in an explainer on what the graph actually reads from a feed. It reads GTIN, brand, MPN, title, price, availability, and variant attributes like size, color, material, and pattern grouped by item_group_id. Feed freshness matters more here than in classic Shopping ads, because AI Mode prefers live inventory data over a nightly batch file.

Why footwear specifically gets punished for thin data

Footwear is one of the least forgiving categories for sparse attributes. A single style can spawn dozens of SKUs across size, width, and colorway, and an AI answer engine has to resolve all of that before it can safely recommend the product. Google requires color, size, age group, gender, size system, and size type for every apparel and footwear listing, and per Google Merchant Center's structured data guidance, missing any one of them can get the item disapproved outright, not just deprioritized.

That is the mechanism worth internalizing: an AI shopping agent is not being generous or stingy with your brand. It is pattern-matching a shopper's question against fields it can trust. If the fields are missing, the product is functionally invisible, no matter how good the shoe is.

Here is what that gap looks like in practice, using a typical trail runner feed pulled straight from a supplier file versus the same product enriched for AI legibility.

AttributeRaw supplier feedAI-ready enrichment
TitleTrail Runner Shoe - MensMen's Waterproof Trail Running Shoe, Wide Fit, Vibram Outsole
GTINmissing00887350000123
Widthnot specifiedStandard (D), Wide (2E), Extra Wide (4E) — sold separately by SKU
Drop / stack heightnot specified8mm drop, 32mm stack height
Use case"for running"Technical trail, loose rock, wet conditions
Weightnot specified10.8 oz (size 9, single shoe)
Waterproofingnot specifiedYes, breathable membrane, not fully submersible

Ask an AI to recommend "a waterproof wide-width trail running shoe under $150 for someone who overpronates," and the enriched row above gives the model six separate fields to match against; the raw row gives it none. One of these products can be recommended with confidence. The other cannot be recommended at all, regardless of how good it actually is on the trail.

What machine-readable footwear data actually requires

A few things separate a catalog an AI agent can use from one it skips past:

  • A valid, unique GTIN per size and colorway variant, not a single UPC shared across a style. Google is explicit that GTINs are one of its strongest matching signals for grouping and comparing offers.
  • Full variant attributes on every SKU: size, width, size system, color, material, and item_group_id so the AI can tell a men's 10.5 wide from a women's 9 standard without guessing.
  • Fit and use-case language a shopper would actually type: overpronation support, drop, stack height, wide-toe-box, waterproof rating, terrain type. These are the phrases that show up in the conversational queries answer engines are built to match.
  • Freshness: stock status and price that reflect current inventory, since AI Mode and ChatGPT both favor live data over stale exports.
  • Consistency across every surface: the same spec on your PDP, your feed, and your schema markup. Contradictions between them read as untrustworthy data to a model, per the same Shopping Graph mechanics above.

None of this requires ripping out a PIM or a feed pipeline. It requires someone, or something, continuously checking every SKU against that list and filling the gaps before an AI agent ever sees the row.

That's the layer Anglera runs on top of whatever system already holds your footwear catalog, PIM, spreadsheet, or none of the above. It scores every product against the attributes AI shopping agents actually parse, flags what's missing or inconsistent, and enriches the gaps so the wide-width waterproof trail runner your customer is asking for shows up as an answer instead of a miss.

Ray Iyer

About the author

Ray IyerCo-founder, Anglera

Ray is a co-founder of Anglera, building the product-data infrastructure for agentic commerce — turning messy catalogs into structured, AI-readable data that buyers and answer engines can find. Previously product at Uber; Stanford CS.

See it on your own SKUs.

A 30-minute walkthrough on your categories and your supplier data.

Book a demo