All posts
Ray Iyer
Ray Iyer
Co-founder, Anglera

Cutting returns in office supplies with better product data

Why office supplies get returned so often, the toner cartridge questions shoppers actually ask, and a checklist to close the gaps before checkout.

Cutting returns in office supplies with better product data

Office supplies look like a low-drama category: a box of pens, a case of paper, a toner cartridge. But it's a compatibility-driven category, which means the product page carries more weight than the product itself. Get one spec wrong and the shopper doesn't get a slightly-off item, they get one that physically doesn't work in their printer. Here's how to find and fix the gaps before they become a return.

Why office supplies returns are a data problem, not a shipping problem

Across ecommerce, roughly a fifth of everything sold online comes back, and one of the biggest drivers is shoppers receiving something that doesn't match what the listing implied, not a damaged or defective item (Digital Commerce 360, Velou). Office supplies has its own version of that problem, and it's sharper than apparel's "looked different in person." In office supplies, the return usually comes down to one missing or wrong fact: the wrong OEM number, an unlisted printer model, a page yield that wasn't actually tested the way the listing implied.

Office supplies ecommerce is also bigger than it looks. The category's top online retailers alone did nearly $8 billion in sales, with average order value around $190 — meaning a lot of orders are multi-item business restocks, not single impulse buys (Digital Commerce 360). When one line item in a restock order is wrong, the buyer doesn't return one SKU quietly. They question the whole order, and sometimes the whole vendor relationship.

The toner cartridge test case

Toner is the sharpest example because it's the category where "close enough" doesn't exist. A cartridge either fits the printer or it doesn't; it either hits its rated yield or the buyer feels shorted.

Here's a typical raw feed for a toner SKU, versus what a shopper (or an AI shopping agent) actually needs to make a confident, return-proof purchase.

AttributeRaw feed (as-received from supplier)Enriched (fixed for shoppers + AI agents)
Title"Toner Cartridge Black""HP 58A Black Standard Yield Toner Cartridge (CF258A)"
Compatible printers(blank)HP LaserJet Pro M404, M405, MFP M428, MFP M429 series
Page yield"high yield"3,000 pages at 5% coverage, per ISO/IEC 19752
OEM vs. compatibleNot specifiedGenuine HP OEM (not remanufactured or compatible-brand)
Cartridge type"toner"Standard-yield, non-high-yield (separate SKU exists for CF258X high-yield, 3,000 vs. 9,000 pages)
Chip / region lockNot specifiedNo regional chip lock; works across US/CA units
Pack quantity"1"Single cartridge (not a combo pack — separate SKU for 2-pack)

Page yield is where the industry itself admits ambiguity. Manufacturers rate yield under ISO/IEC 19752 (toner) or 24711 (inkjet), a standardized 5%-coverage print test, but real-world yield swings with page content, job size, and how much of a page is covered (LD Products). If a listing just says "high yield" without a page number and the test standard, a shopper has no way to compare it to a competing SKU, and a business buyer restocking for an office of 40 people has no way to budget for it. That ambiguity turns into a return the moment the cartridge runs out sooner than expected.

The standard-yield-versus-high-yield split deserves its own line, too. HP alone sells separate SKUs (like the CF258A and CF258X pair above) for the same cartridge family at roughly 3x the page count difference. If that distinction isn't a clean, filterable attribute, shoppers guess, and roughly a third of them guess wrong in the direction of "return and reorder."

The seven questions every office supplies page needs to answer

Before a toner cartridge, a label roll, or a shredder ever gets an "add to cart," the page needs to answer:

  1. What exact device does this fit? Full model list, not just a brand name.
  2. What's the OEM part number, and is this OEM, compatible, or remanufactured? These are different products at different price points and different reliability expectations.
  3. What's the rated yield, and under what standard? A number with no test method is a guess dressed as a spec.
  4. Is this the standard or high-yield version? If both exist, cross-link them so the shopper picks intentionally.
  5. What's in the box? Single unit, multi-pack, starter cartridge (often lower-yield than the retail version).
  6. Are there regional or chip restrictions? Relevant for cartridges and increasingly for smart-lock supplies.
  7. What's the expected shelf life / use-by window? Toner and ink both degrade; unopened age matters for bulk business buyers.

Miss any of these and you've handed the shopper a coin flip. Miss more than one and you've handed an AI shopping agent a reason to skip your listing entirely — tools like ChatGPT, Google AI Mode, and Perplexity are increasingly the layer that filters "compatible with my printer" before a human ever sees your page, and they can't recommend what they can't verify.

Ask an AI to recommend a toner cartridge for an HP LaserJet Pro M404dn, and the model isn't checking the box art. It's parsing whatever compatibility and yield data your feed actually exposes. If the fields are blank, your product doesn't get considered, competitor's does.

Fixing this at scale

Manually filling these seven fields for one SKU is a five-minute task. Doing it across a catalog of thousands of consumables, refills, and printer accessories is a data-maintenance job that never ends, because manufacturers add SKUs and change part numbers continuously.

That's the layer Anglera runs on top of your existing PIM or catalog. It doesn't replace where your data lives; it continuously scores every product page against gaps like the ones above, gap-fills the missing compatibility, yield, and OEM fields, and keeps them current as new cartridge SKUs launch. The result is a catalog that answers the shopper's real question the first time, and reads cleanly to the AI agents now doing a growing share of the pre-purchase filtering.

Ray Iyer

About the author

Ray IyerCo-founder, Anglera

Ray is a co-founder of Anglera, building the product-data infrastructure for agentic commerce — turning messy catalogs into structured, AI-readable data that buyers and answer engines can find. Previously product at Uber; Stanford CS.

See it on your own SKUs.

A 30-minute walkthrough on your categories and your supplier data.

Book a demo