You've read about AI product recommendations. You've seen the case studies. Somewhere in the back of your mind, you know your store should probably be using smarter product discovery than whatever Shopify ships by default.
But then you look at your catalog. Half your products are tagged "summer" from two seasons ago. Your collections are a mess of manual overrides. Product descriptions range from carefully written paragraphs to a single bullet point that says "soft fabric." And you think: I need to clean all of this up before AI can do anything useful with it.
So you don't install anything. You add "fix product tags" to your Q3 to-do list, right between "redo email flows" and "finally learn GA4." And six months later, nothing has changed.
This is one of the most common reasons merchants delay AI-powered recommendations and search. The "garbage in, garbage out" principle makes intuitive sense. If your data is messy, how can an AI model produce good results?
The answer depends entirely on what kind of data the AI is actually reading.
The tag myth
When most merchants think about "catalog data," they think about the fields they maintain manually: product tags, collection assignments, metafields, vendor labels. And they're right that these fields are often a disaster. The average Shopify store has duplicate tags, inconsistent naming conventions, and collections that haven't been audited in over a year.
The assumption is that AI recommendation engines work like filters. You tag a product "blue dress," the AI reads that tag, and it recommends other things tagged "blue dress." Under that model, bad tags produce bad recommendations.
Some older recommendation tools do work this way. They parse your tag structure and use rule-based matching to surface related products. If your tags are inconsistent, the matching breaks down.
But modern AI models don't work like that at all.
What AI actually reads
Modern AI recommendation engines pull from three data sources that have nothing to do with your tagging structure.
Product descriptions and titles. Natural language processing reads your product copy the way a human would. If your description says "lightweight linen blazer, relaxed fit, ideal for warm-weather weddings," the AI understands the material, the fit, the use case, and the occasion without needing a single tag. Even a mediocre product description contains more semantic information than a perfectly organized tag structure.
Product images. Computer vision models analyze what's actually in the photo. Color, pattern, style, category, even the setting and context of the shot. A model trained on your catalog can identify that two products share a similar aesthetic from the images alone, regardless of whether you've tagged them identically (or at all).
Customer behavior data. This is the signal that matters most, and it requires zero catalog cleanup from you. Every time a customer browses a product, adds something to cart, or completes a purchase, the AI learns relationships between products. If 200 customers who bought Product A also bought Product B, those products are related. No tag needed. This behavioral data accumulates automatically from the moment a recommendation engine goes live on your store.
PersonalizerAI, for example, builds custom models for each merchant using all three signals. The AI reads your product descriptions and images using Google Gemini and Anthropic Claude, then layers in real-time behavioral data from your store's actual traffic. Tags and metafields are a bonus input when they exist, but they're not the foundation.
The cost of waiting
The irony of the "clean up first, install later" approach is that it costs you money while producing no benefit.
Every day without AI-powered recommendations is a day your visitors see generic product suggestions (or none at all). For a store doing $30K/month with 15,000 monthly sessions, even a conservative 12% lift in AOV from personalized recommendations represents about $3,600/month in additional revenue. That's $3,600/month you're leaving on the table while you reorganize tags that the AI doesn't need.
The behavioral data point matters here too. AI recommendation models improve over time as they process more customer interactions. Installing three months from now means three months of customer behavior data you'll never get back. The model that goes live in March will be smarter by June than the model that goes live in June will be by September, because it had a head start on learning from your traffic.
The AI learns your customers by watching how they browse your store. It needs traffic, not taxonomy.
What actually matters (the short list)
This doesn't mean catalog quality is irrelevant. There are a few things worth checking, and none of them require a multi-week cleanup project.
Product descriptions that say something real. They don't need to be long. They need to describe the product. "Blue widget" gives the AI almost nothing. "Ceramic desk organizer, 6-inch, matte navy finish, three compartments" gives it material, size, color, function, and category. If your descriptions are mostly blank or one-word, spending an afternoon filling in the worst offenders will improve AI performance noticeably.
Decent product images. If you're selling on Shopify, you almost certainly have product photos. That's enough. The AI doesn't need studio-quality lifestyle shots. It needs to see the product clearly. If you have products with no images or only placeholder thumbnails, those are worth fixing regardless of AI.
Active traffic. The AI needs visitors to learn from. If you're getting under 500 monthly sessions, the behavioral models won't have enough data to personalize effectively. At that traffic level, focus on acquisition first. Above 1,000 sessions/month, most AI recommendation engines can start producing meaningful results within the first two weeks.
Products that are actually in stock. This one is obvious but easy to forget. AI models will recommend out-of-stock products if your inventory sync is broken. Make sure your Shopify inventory status is accurate so the AI can exclude items that aren't available.
Four things, none of which involve reorganizing your tag structure.
Stop optimizing the wrong thing
Merchants who get the most from AI recommendations install early and iterate based on real performance data. They check which recommendation types produce the most clicks, test different widget placements, and let the AI's learning loop do the work on product matching.
Meanwhile, the merchants still reorganizing tags in a spreadsheet are planning a catalog overhaul that never quite happens, convincing themselves that AI will "work better" once everything is perfect.
Your catalog, your tags, your metafields can all stay exactly as messy as they are right now. The AI reads deeper than any of it.
If you want to see what AI recommendations would actually look like on your store with your current catalog data, PersonalizerAI offers a free trial with a 30-minute setup. No cleanup required.
.jpg&w=3840&q=75)