All posts
Ray Iyer
Ray Iyer
Co-founder, Anglera

What messy product data actually costs Pet Supplies retailers

Thin pet product catalogs cost more than lost sales. Here's what messy data does to search, conversion, and AI shopping visibility in 2026.

What messy product data actually costs Pet Supplies retailers

Pet supplies retail is in a strange spot: the category is growing fast, more of it than ever is happening online, and most catalogs still can't answer basic questions like "will this fit a 70-pound dog" or "is this grain-free." That gap is not cosmetic. It shows up in search rankings, cart abandonment, return rates, and now, whether an AI shopping agent recommends the product at all.

The pet catalog is more complicated than it looks

A single SKU in pet supplies carries more decision-critical attributes than most categories realize. Life stage (puppy, adult, senior), breed size suitability, protein source, ingredient exclusions (grain-free, poultry-free), coat type or hairball formulas for cats, aquarium water chemistry compatibility, bird seed blend ratios. None of this is optional trivia. It is what a shopper — or an AI agent shopping on their behalf — filters on before they'll even look at price.

Most PIMs and supplier feeds were not built to carry that depth. A distributor feed typically arrives with a title, a brand, a category, and maybe a weight. Life stage, breed size, and ingredient flags get left to manual entry, and manual entry does not scale across tens of thousands of variants (a 30-lb bag, a 15-lb bag, a 5-lb bag, three flavors, two formulas).

The category is also large enough that gaps compound. The U.S. pet industry hit $158 billion in 2025, and pet food and treats alone accounted for $68.3 billion of that. Online is no longer the minority channel: 53% of pet parents now purchase products online versus 45% in-store, and pet food e-commerce sales grew 45.7% between 2020 and 2025. A thin catalog isn't a rounding error on a shrinking channel. It's a growing hole in your biggest one.

What thin data actually costs

The costs show up in three places, and they compound on each other.

Where it breaksWhat happensWhy it costs money
Search and filteringShopper filters "large breed," "grain-free," "senior" — your SKU is missing the attribute, so it doesn't surfaceLost impressions before the shopper ever sees a price
ConversionProduct page is missing feeding guidelines, ingredient list, or size-fit guidanceShopper leaves to check another retailer or Amazon, where the same SKU has a fuller listing
Returns and supportShopper guesses on breed size, harness fit, or aquarium tank compatibility and gets it wrongReturn shipping, restocking, and a support ticket eat the margin on that order

None of these are dramatic single failures. They're small leaks, repeated across a catalog with thousands of SKUs and dozens of attributes per SKU, every day. That's what makes messy pet data expensive: not one broken listing, but a systemic gap that touches most of the catalog most of the time.

Before and after: a bag of dog food

Here's what a typical distributor feed looks like next to what the same product needs to actually compete for search, filters, and AI recommendations.

AttributeRaw distributor feedEnriched
Title"Dog Food Chicken 30lb""Large Breed Adult Dry Dog Food, Chicken & Rice, 30 lb Bag"
Life stage(missing)Adult
Breed size(missing)Large breed (50+ lb)
Protein sourceChickenChicken (first ingredient), no poultry by-product
Grain status(missing)Contains grain (rice, oats)
Feeding guideline(missing)3-4 cups/day for 60-90 lb adult dog
Special diet flags(missing)Not grain-free; not suitable for poultry-allergic dogs

The raw version answers "what is this." The enriched version answers "is this right for my dog" — which is the actual question being asked, whether it's typed into a search filter or asked of an AI agent.

Ask an AI to recommend a large-breed senior dog food, grain-free

Type that request into ChatGPT, Gemini, or Perplexity today and watch what happens: the assistant reasons over life stage, breed size, and ingredient exclusions simultaneously, then only surfaces products whose data explicitly confirms all three. A product with a great formula but a missing "grain-free" flag or absent breed-size attribute doesn't get excluded on merit. It gets excluded because the agent can't verify it.

This is the part of 2025-2026 that raises the stakes past "better search rankings." OpenAI's Instant Checkout, live since September 2025 and running the Agentic Commerce Protocol, requires retailers to push machine-readable product feeds directly to the platform — schema, availability, and pricing all included (OpenAI). Google's Universal Commerce Protocol, announced for AI Mode and Gemini in early 2026, follows the same logic. Pages with clean structured data are already cited roughly 3.1x more often in AI Overviews than pages without it. A missing attribute used to cost you a filter click. Now it can cost you being considered at all.

Marketplace pressure compounds it. Chewy, Amazon, and Petco all compete on the same SKUs your site sells, and their listings tend to carry deeper attribute sets because they've invested in enrichment at scale. When a shopper (human or AI) compares a thin listing to a full one for the identical bag of food, the fuller listing wins the click, the conversion, and increasingly, the AI recommendation.

Fixing this without touching the PIM

None of this requires ripping out a PIM or building an enrichment team from scratch. Your PIM stores the data; Anglera does the work of finding what's missing — life stage, breed size, ingredient flags, feeding guidance — and filling it in at the SKU and variant level, continuously, as new products land in the feed. It plugs into whatever system already holds your catalog, no migration required, and scores every listing for the kind of completeness that both shoppers and AI shopping agents are now filtering on.

Ray Iyer

About the author

Ray IyerCo-founder, Anglera

Ray is a co-founder of Anglera, building the product-data infrastructure for agentic commerce — turning messy catalogs into structured, AI-readable data that buyers and answer engines can find. Previously product at Uber; Stanford CS.

See it on your own SKUs.

A 30-minute walkthrough on your categories and your supplier data.

Book a demo