Your Product Page Looks Great to Humans. AI Agents Can't Read It

Share
Your Product Page Looks Great to Humans. AI Agents Can't Read It
Photo by Kind and Curious / Unsplash

Your product page is stunning. The banner images pop. The ingredient infographics explain everything. The Do's and Don'ts section is perfectly designed. There's just one problem: AI agents can't see any of it

55% of ecommerce sites fail to communicate the content inside their informational images to machines. And in a year where AI-driven orders on Shopify are up 15x and 73% of shoppers are using AI in their buying journey, that failure is costing real revenue.

Your Product Page Has Two Audiences Now

When a customer visits your Shopify store, they see a beautifully designed product page — hero images, lifestyle photography, ingredient infographics, FAQ banners, and trust badges. Every pixel is intentional.

But your product page doesn't just have human visitors anymore. It has two machine audiences that evaluate your products completely differently:

Web crawlers (ChatGPT, Perplexity, Google Gemini) visit your product page like a browser. They can read HTML text, follow links, and parse structured data. But they cannot read text embedded in images. A stunning banner that says "Made with Organic Amla and Henna" is just a rectangle of pixels to a crawler. Google is explicit about this: "Avoid embedding important text inside images, because not all users can access them."

Shopping agents (Shopify's agentic storefronts, OpenAI's ChatGPT Shopping, Google Shopping AI) use your store's structured API data — product titles, descriptions, metafields, variant data, and tags. They never visit your web page at all. They consume data feeds. Shopify's Catalog uses AI to categorize and standardize product data across billions of items — but it can only work with the structured data your store actually provides.

These two channels often see dramatically different versions of your product. And according to Adobe's research, the average product page scores just 66% for machine readability — meaning roughly a third of your product content is invisible to AI.

A Real Example: Bombay Shaving Company's Beard Colour

We ran a scan on Bombay Shaving Company's "Beard Colour (Natural Black)" using ZipShop's free AI readiness scanner. The results illustrate the problem perfectly.

Web Crawlers scored it 55/100. Shopping Agents scored it 95/100.

The product's Shopify API data is excellent — 95 out of 100. The merchant clearly invested in filling out product descriptions, metafields, and structured data. But the web page tells a completely different story.

Here's the dimension-by-dimension breakdown:

Dimension Web Crawlers Shopping Agents What's Happening
Identification 3/5 5/5 API has full structured product data; the web page buries identifiers in images
Ingredients & Formulation 4/5 5/5 Amla and Henna ingredients are in a banner image — crawlers can't read them
Usage Instructions 4/5 5/5 "Where to use" section (Beard, Moustache, Sideburns) is an infographic
Safety & Allergens 2/5 4/5 Do's & Don'ts with patch test warnings is entirely image-based
Variants & Pricing 3/5 5/5 API has clean variant data; the web page only shows partial pricing in HTML

The 40-point gap between channels isn't because the product data is bad. It's because excellent data is being trapped in images where one of the two channels — web crawlers — can't access it.

This is the image trap: pages that are information-rich for humans and information-poor for machines.

Why This Gap Is a Revenue Problem in 2026

A year ago, this was a minor SEO issue. Today, it's directly tied to revenue.

Agentic commerce is live on Shopify. AI agents are browsing stores, comparing products, and completing purchases on behalf of shoppers. The numbers have moved fast:

  • AI orders on Shopify up 15x since January 2025 (Shopify)
  • AI referral traffic to retail up 393% year-over-year in Q1 2026, converting 42% better than traditional channels (Adobe)
  • 73% of shoppers are already using AI in their buying journey (Riskified)
  • $3-5 trillion in global commerce projected to be mediated by AI agents by 2030 (McKinsey)
  • $300-500 billion in U.S. agentic commerce by 2030, roughly 15-25% of total online retail (Bain)
  • Traditional search volume falling 25% by end of 2026 due to AI alternatives (Gartner)

When a shopper asks an AI agent "Find me a natural beard colour that's ammonia-free and safe for sensitive skin," the agent needs to find that information in parseable text or structured data. If it's trapped in a banner image, the agent can't match your product to the query. Your competitor whose allergen info is in plain HTML text wins the recommendation.

The two channels diverge even further with Shopify's Universal Commerce Protocol. Shopping agents using UCP consume your product data through structured APIs — they never see your web page. If your API data is rich (like Bombay Shaving Company's), great. But if all your effort has gone into visual design while neglecting structured data, shopping agents see an empty shell.

The Five Most Common Image Traps

Based on scanning hundreds of Shopify stores and Baymard Institute's research showing 55% of ecommerce sites fail on informational image accessibility, these are the most common places merchants hide critical information inside images:

1. Ingredient and Material Infographics

The most common trap. Merchants create beautiful ingredient breakdowns — icons, illustrations, percentage bars — all as a single image. Crawlers see nothing. Baymard found that 52% of ecommerce sites don't include descriptive text alongside informational product images.

This is especially damaging for beauty, skincare, supplements, and food brands where ingredients drive purchase decisions. When an AI shopping agent compares products for a query like "vitamin C serum without parabens," it can only match products where that ingredient data exists as text.

Fix: Keep the visual infographic for humans, but also include the full ingredient list as HTML text. A collapsible section or a text block below the image works. Shopify metafields can store ingredient data that both the page template and the API can access — and those metafields are critical for AI agent recommendations.

2. Usage Instructions and How-To Sections

"How to Apply," "Where to Use," "Dosage Guide" — these sections are almost always designed as step-by-step infographics with numbered icons. AI agents need this information to answer questions like "How do I use this product?" or "Is this suitable for my beard type?"

ChatGPT processes over 2.5 billion prompts per day, with shopping-related queries growing 25% in the first half of 2025 alone. Many of those queries are specific usage questions that only get answered if the data is machine-readable.

Fix: Use a metafield for usage instructions. Render it as both a designed section (for visual appeal) and as structured HTML text (for crawlers). The metafield data also flows through to the API for shopping agents.

3. Safety and Allergen Warnings

Do's and Don'ts lists, patch test instructions, allergen callouts — these are legal and trust-building content that merchants naturally design as eye-catching graphics. But when an AI agent is asked "Is this product safe for sensitive skin?" and the answer exists only in an image, the agent either can't answer or recommends a competitor that has the data in text.

70% of consumers are at least somewhat comfortable with an AI agent making purchases on their behalf. Those agents need to verify safety claims from structured data, not image pixels.

Fix: Safety information should always be in HTML text. It's too important to hide in images. Use a dedicated metafield (e.g., safety_warnings or allergen_info) and render it as text on the page.

4. FAQ Sections as Designed Cards

Many stores style their FAQs with custom graphics — bubble designs, accordion-style cards with decorative elements, or full-width banner images for each Q&A pair. Shopify themes with native FAQ sections render these as HTML, but custom-designed ones often don't.

This matters because FAQ content is prime material for People Also Ask boxes and AI-generated answers. Google AI Overviews now appear on 14% of shopping queries, and informational shopping queries like "best [product]" trigger AI Overviews 83% of the time. HTML-formatted FAQs are eligible for these — image-based ones aren't.

Fix: Use Shopify's native FAQ page or a FAQ metafield that renders as HTML. The visual design can come from CSS, not from baking text into images. Bonus: HTML FAQs are eligible for Google's FAQ rich snippets and are directly readable by AI agents.

5. Trust Badges and Value Propositions

"Free Shipping," "30-Day Returns," "Cruelty-Free," "100% Organic" — these are often icon-based trust strips. While less critical than ingredients or safety info, they influence AI agent decisions about which products to recommend. Only 18% of ecommerce product pages have complete schema markup — which means these trust signals aren't reaching AI through structured data either.

Fix: Include key value propositions in your product description or tags. JSON-LD structured data can carry shipping and return policy information that agents read. Google explicitly recommends JSON-LD over other structured data formats.

The Machine Readability Gap Is Wider Than You Think

The image trap is part of a broader product data visibility problem. Adobe found that while the average U.S. retail homepage scores 75% for machine readability, individual product pages score just 66%. For stores with thousands of SKUs, that means a third of product content is invisible to AI.

WebAIM's 2026 analysis of the top 1 million websites found that 16.2% of all images still lack alt text entirely. One in four linked images (product banners, clickable infographics) is missing alt text — the exact content-rich images where merchants embed their most important information.

The problem compounds: a SALT.agency audit of 100 major ecommerce sites found that 33% of URLs didn't contain JSON-LD structured data at all. Combined with image-trapped text and missing alt text, the result is that AI agents are making recommendations based on a fraction of the actual product information that exists.

As Shopify's enterprise team puts it: "Most product data was built for humans browsing websites. There is a gap between what renders well for a human shopper and what an AI agent can actually consume."

How ZipShop Catches This

ZipShop's free AI readiness scanner evaluates your store from both channels simultaneously — exactly as AI agents experience it.

For each product, we generate two scores:

  • Web Crawler Score: What ChatGPT, Perplexity, and other web-crawling AIs can extract from your product page's HTML
  • Shopping Agent Score: What structured API data (titles, descriptions, metafields, variants) is available to agentic commerce platforms

When these scores diverge significantly — like Bombay Shaving Company's 55 vs. 95 — it's a clear signal that critical product information exists in your system but isn't reaching AI crawlers through your web page.

The scanner also checks nine brand-level signals that affect AI discovery: robots.txt configuration, llms.txt presence, JSON-LD schema richness, sitemap, social links, blog, policy pages, product title quality, and agentic storefront readiness.

The entire scan takes 30 seconds and requires no install. You get a detailed report with specific, actionable recommendations for each issue found.

The Fix Is Not Hard. It's Just Overlooked.

The encouraging part of this problem is that it's not a data problem — it's a presentation problem. Most merchants already have rich product data in their Shopify admin. The issue is that their theme templates render that data as images instead of text.

The fix usually comes down to three things:

1. Audit your product pages for image-trapped text. Right-click on your product page and "View Source." If key information (ingredients, usage, safety, FAQs) doesn't appear in the HTML, crawlers can't see it. Use the Shopify SEO checklist as your starting guide.

2. Use metafields for structured content. Shopify metafields store data that's accessible to both your web theme (for rendering) and the Storefront/Admin API (for shopping agents). Create metafields for ingredients, usage instructions, safety warnings, and FAQs. This is the single highest-leverage fix — it serves both channels from one data source.

3. Duplicate image content as HTML text. You don't have to choose between beautiful design and machine-readable content. Keep the infographics. But also include a text version — a collapsible section, a tab, or even a <div> that's visually de-emphasized but present in the HTML.

Your product page can be beautiful and machine-readable. It just takes knowing where the gaps are.

FAQ

Why can't AI agents read text in images?

AI web crawlers process HTML, not pixels. When you embed text inside an image (as an infographic, banner, or designed card), the crawler sees an <img> tag — not the words inside it. Unless you add alt text or duplicate the content as HTML, the information is invisible. Google explicitly recommends against embedding important text in images for this reason.

How do I know if my product pages have this problem?

The fastest way is to right-click your product page, select "View Source," and search for key product information (ingredients, safety warnings, usage instructions). If you can't find it in the HTML, crawlers can't either. For a comprehensive check, run a free AI readiness scan — it audits both your web page HTML and your structured API data, showing you exactly where the gaps are.

Does adding alt text to images fix the problem?

Partially. Alt text helps web crawlers understand what an image contains, and WebAIM reports that 16.2% of images still lack it entirely. But alt text is limited — you can't fit an entire ingredient list or FAQ section into an alt attribute. The better fix is to include the information as both a visual element (for humans) and HTML text (for machines).

What about AI vision — can't ChatGPT read images now?

ChatGPT and other LLMs do have vision capabilities in their chat interfaces, but their web crawlers and shopping agents still process pages primarily as text and structured data. Even when vision is available, it's less reliable than parsing clean HTML. Structured data boosts GPT-4 accuracy from 16% to 54% compared to unstructured content. Don't rely on AI vision to compensate for missing text.

Which product categories are most affected by the image trap?

Beauty, skincare, supplements, food and beverage, and health products are hit hardest because ingredients, safety warnings, and usage instructions are the primary purchase drivers — and they're almost always designed as infographics. Fashion and home goods are affected too, especially for size charts, care instructions, and material compositions.

Do Shopify metafields help with AI discovery?

Yes — metafields are the highest-leverage fix. They store structured data that's accessible to both your web theme (for rendering on the page) and Shopify's APIs (for shopping agents). When Shopify's Catalog syndicates your products to ChatGPT, Perplexity, and other AI platforms, it pulls from your structured data including metafields. Creating metafields for ingredients, usage instructions, and safety info serves both channels from one data source.

How big is the machine readability gap for ecommerce?

Adobe found that the average U.S. retail product page scores just 66% for machine readability — meaning roughly a third of content isn't optimized for AI. That's a significant gap considering AI referral traffic to retail grew 393% year-over-year in Q1 2026 and converts 42% better than traditional channels.

What's the difference between the image trap and missing structured data?

They're related but distinct problems. The image trap is about content that exists on the page but is embedded in images rather than HTML text — so web crawlers can't read it. Missing structured data is about content that isn't formatted as JSON-LD schema — so search engines and AI agents can't categorize it. A product page can have both problems simultaneously, and often does. Fixing both requires adding HTML text versions of image content AND implementing proper schema markup.

Does this affect my Google SEO too, or just AI discovery?

Both. Google has always recommended against embedding important text in images. Text in images can't be indexed for search, can't be translated, and isn't accessible to screen readers. The GEO vs SEO overlap here is significant — fixes for AI readability also improve traditional search performance. The difference is that in 2026, the cost of ignoring this problem is higher because AI is now a direct purchase channel, not just a discovery channel.

How quickly can I fix this?

For most Shopify stores, the core fixes (adding metafields, duplicating image content as HTML text, adding alt text) can be done in a weekend for your top 10-20 products. Start with your best sellers — those products get the most AI queries. Then work through the rest of your catalog over time. The Shopify SEO checklist breaks this into specific, actionable steps.


Want to see exactly what AI agents see when they look at your products? Run a free AI readiness scan → It takes 30 seconds, requires no install, and shows you every image trap and data gap in your store.