All posts
Ray Iyer
Ray Iyer
Co-founder, Anglera

What your on-site search logs reveal about catalog gaps

Your on-site search logs already show which attributes are missing. Here's how to read zero-results, filter gaps, and exit rate as an enrichment queue.

What your on-site search logs reveal about catalog gaps

Most retailers treat on-site search as a UX widget to tune, not a data-quality dashboard to read. That's a mistake. Every zero-result query, every filter a shopper tries and can't use, every search-result page someone abandons is a shopper telling you, in their own words, what your catalog is missing. Search users convert at roughly 2-3x the rate of browsers — industry benchmarks put search conversion around 4-6% against a 1-3% site average — which means these logs aren't a minor UX report. They're your highest-intent traffic telling you exactly where the catalog fails them.

The three signals that matter

Site search platforms already expose these numbers. The problem is almost nobody routes them to the merchandising or content team as a to-do list.

SignalWhat it showsWhere to find it
Zero-results rateQueries with no matching products — vocabulary gaps, missing synonyms, or products that genuinely lack the attribute a shopper searched forSearch analytics platform (Algolia, Bloomreach, Klevu) or GA4 event view_search_results filtered to zero results
Searches with no filter coverageShoppers who search, then try to filter by an attribute (size, material, voltage, compatibility) that isn't populated on enough SKUs to be a usable facetFaceted nav analytics or a query against your PIM/catalog export: attribute fill rate per category
Search-exit rateShoppers who search, land on a results page, and leave without clicking a productSearch analytics session flow, or GA4 exploration comparing view_search_results to next-page or select_item events

Industry data on zero-results is consistent enough to plan against: a typical catalog runs 10-15% zero-result queries, well-run search engines get that under 5%, and top performers push toward 2%. Every point above that floor is either a synonym problem your search vendor can patch, or a coverage problem only enrichment can fix — and you need to know which is which before you spend engineering time on the wrong one.

Reading zero-results as a coverage audit, not a search-tuning task

Pull your top 100-200 zero-result queries for the last 30-90 days and split them into two buckets:

Vocabulary gaps — the product exists, but the query used different words than your title, description, or attribute values ("15mm socket" vs. "15 millimeter socket," "waterproof" vs. IP67). These are synonym-dictionary fixes on the search platform side.

Coverage gaps — no product in the catalog carries the attribute value being searched, or the value exists in a supplier spec sheet but was never extracted into a structured, searchable field. This is the larger bucket in most catalogs, and it's an enrichment problem, not a search-relevance problem. A shopper searching "food-safe stainless" or "5-year warranty" is describing an attribute your PIM may have room for but doesn't actually have populated at scale.

The split matters because Baymard Institute's research on no-results pages found roughly a third of sites still treat a zero-result page as a dead end, with no recovery path — and a static suggestion box on the results page can't fix a query that's asking for an attribute value that genuinely doesn't exist anywhere in the catalog. You can't merchandise your way out of a data gap.

Filter usage tells you which attributes to prioritize

Faceted filters are a second, quieter signal. If shoppers repeatedly select a filter and it collapses their result set to near-zero, or if a facet you'd expect to be well-used (like "compatible with" or "certification") barely gets clicked because it's greyed out or sparsely populated, that's a direct read on attribute fill rate by category — ranked by actual buyer demand, not by whichever fields happen to be easiest to fill first. This is a better prioritization signal than guessing which attributes "should" matter, because it's built from real search behavior instead of assumption.

Cross-reference filter engagement against your attribute completeness report (most PIMs, from Akeneo to Salsify to Pimcore, can export fill rate by attribute and category). Where filter demand is high and fill rate is low, that's the enrichment queue, ranked by revenue opportunity rather than gut feel.

Search-exit rate closes the loop

Zero-results and filter gaps tell you what's missing before the click. Search-exit rate tells you what's missing after it — a shopper found something, clicked in, and still didn't buy. Compare PDP-level bounce and exit rates for traffic arriving from on-site search against traffic arriving from category browse. A meaningfully higher exit rate from search traffic usually means the result matched the query on title alone but the PDP itself doesn't answer the question the search implied — missing spec, no compatibility info, no size chart, an incomplete image set. Search sent a high-intent shopper to the right page; the page itself didn't close the sale.

Turning this into a measurement loop

  1. Pull top zero-result and low-CTR queries monthly, tag each as vocabulary or coverage.
  2. Cross-reference filter usage with attribute fill rate by category to rank enrichment priority.
  3. After each enrichment pass, re-measure zero-results rate, search-driven conversion, and PDP exit rate on the affected categories, not just sitewide averages — sitewide numbers dilute the signal from the SKUs you actually touched.
  4. Track the downstream metrics too: fewer zero-result and support-ticket-driving queries usually shows up as lower return rates and lower support load, since the same missing attributes that block a search often show up later as a "why doesn't this fit" ticket or a return reason code.

On-site search logs are a live, high-intent feedback channel on catalog completeness — arguably a better one than organic search or AI-referral traffic, because it's your own buyers, on your own site, describing exactly what they can't find. Anglera reads that gap the same way: it scores attribute completeness against your live catalog, extracts and fills the missing values from supplier and source documentation, and pushes them back into whatever PIM you run — so the fixes show up where your search logs said they needed to be, without a rebuild.

Ray Iyer

About the author

Ray IyerCo-founder, Anglera

Ray is a co-founder of Anglera, building the product-data infrastructure for agentic commerce — turning messy catalogs into structured, AI-readable data that buyers and answer engines can find. Previously product at Uber; Stanford CS.

See it on your own SKUs.

A 30-minute walkthrough on your categories and your supplier data.

Book a demo