Walmart: ChatGPT checkout converted 3x worse than website

(searchengineland.com)

77 points | by speckx 3 days ago

14 comments

janalsncm 1 hour ago
This is from one of the links in the article
> Why this is happening. Two forces are slowing agentic commerce, according to Leigh McKenzie, director of online visibility at Semrush: infrastructure and trust. Real-time catalog normalization across tens of millions of SKUs is a decade-scale problem Google already solved with Merchant Center, and consumers still default to checkout flows they trust — Apple Pay, Google Wallet, and Amazon one-click.
It turns out when you step outside of “hard tech” problems like building GPT6 there are all of these details others have solved already. E-commerce has been optimized to the last decimal point for the last 30 years.
OpenAI is new to it, and if I had to guess, not that interested in getting good at it.
[-]
- petcat 1 hour ago
  > not that interested in getting good at it
  I think they're interested in getting good at it. They just don't want to put in the human time and effort to do so. They expect their many failures and short-comings to be shored up by continuous model training.
  But that, of course, means that in the meantime it will suck and nobody will use it.
  [-]
  - TeMPOraL 25 minutes ago
    I think they're operating beyond their current (human) capacity, trying to test out too many things at a time.
    But a dreamer in me entertains another idea: perhaps they're just holding back, because they realize that actually succeeding at this will instantly kill (or at least mortally wound) e-commerce as we know it.
    (This is a more narrow version of my belief that general AI tools like LLMs fundamentally don't fit as additions to products, but rather subsume products, and this makes them an existential threat to the software industry. Not to software or computing, just to all the software vendors, whose job is to slice off pieces of computational universe, put them in boxes to prevent interoperability, and give each a name so it's a "product" that can be sold or rented).
    [-]
    - latexr 10 minutes ago
      > But a dreamer in me entertains another idea: perhaps they're just holding back, because they realize that actually succeeding at this will instantly kill (or at least mortally wound) e-commerce as we know it.
      Sam Altman doesn’t give a shit about anyone but himself and has time and again shown he has no restraint for trampling over others to further his own goals. Why would e-commerce be where he draws the line?
    - NicoJuicy 10 minutes ago
      > perhaps they're just holding back
      Considering the money they need, they over promise and under deliver.
- rvz 30 minutes ago
  Also having to wait for ChatGPT for a "thinking" response to search for information that is slower than a Google search loses them lots of money.
  I believe that it can still work and I won't claim about being unsurprised about this failure. But this is a great opportunity to execute this problem really well if OpenAI and others are not interested in getting good at this.
  Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.
  The only problem is that it must be quicker or just as quick as a Google search, and also compatible with the existing checkout flows.
  [-]
  - TeMPOraL 11 minutes ago
    > Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.
    Any details on that? I feel the answer is more likely there than in "friction".
    Hardly any purchase of consequence is so sensitive to friction that the difference between Google Search and an LLM response matters (especially that in reality, we're talking 20+ manual searches per one LLM response). I.e. I'm not going to use LLMs advise on some random 0-100$ purchase anyway, and losing #$ on a ##$ purchase due to suboptimal choice is not that big of a deal - but I absolutely am going to consult it (and have it compile tables and verify sources) on a $500+ purchase and for those I can afford spending few more minutes on research (or rather few hours less, compared of doing it the usual way).
__alexs 1 hour ago
(Good) E-commerce has been ruthlessly optimised to get shoppers to products they'll actually buy and then remove all distractions from buying.
A chat interface is just fundamentally incompatible with this. The agent makes it too easy to ask questions and comparison shop.
[-]
- itopaloglu83 1 hour ago
  They practically want to funnel users like cattle and not let them think about it or compare things.
  It’s like corporations are angry that they need to go through us to get our money.
  [-]
  - amelius 25 minutes ago
    Users already use the internet to compare things. It makes no sense to bet on them not doing that.
  - ares623 34 minutes ago
    You can sit on your couch all day for 30 days and corporations will still be able to take your money. The marvels of frictionless payments.
- bko 1 hour ago
  Why is this good? I want an impartial consistent system for shopping. If I can find it at a different site for a lower price, I should be able to do so. I should also be able to have it give me non-bot reviews and ask relevant questions about the product.
  The same way I think shopping at Amazon is better than a place like Nike due to objectivity and comparison, I think a chat interface has the potential to take this to another level since places like Amazon have degraded considerably in terms of things like fake third party products and fake reviews.
  [-]
  - __alexs 21 minutes ago
    The buyer of this technology is not shoppers, it's retailers. The measurement of quality is "does it make us more money?" not "does it help me make better buying choices."
    Retailers do not want you to make better choices. They want you to buy the widget.
    A lot of evidence suggests that also shoppers aren't that interested in making the best choice either. They want to make a tolerable choice with as little effort as possible. There is no basically no consumer market for "power shopping" outside of weird niches like pcpartpicker.com etc.
  - throwaway290 37 minutes ago
    > impartial consistent system for shopping
    > for a lower price
    Catalog is impartial, chatbot is ads pretending as advice.
    [-]
    - danlitt 16 minutes ago
      I do agree with your conclusion, but the catalog in most online shops is certainly not impartial. Amazon sells the entire first page of search placement, for example.
    - TeMPOraL 23 minutes ago
      Catalog is an ad, the SKU database behind the catalog is impartial (at least as much as it gets), but no one is giving you access to that.
- TeMPOraL 45 minutes ago
  > (Good) E-commerce has been ruthlessly optimised to get shoppers to products they'll actually buy and then remove all distractions from buying.
  The only e-commerce site that fits this standard is that old one for buying (IIRC) nuts and bolts or such, that pops up on HN every other year, and whose name sadly escapes me now. Everyone else is ruthlessly optimizing their experience to fuck shoppers over and get them to products the vendor wants them to buy, not the products the shoppers actually want (or need).
  > A chat interface is just fundamentally incompatible with this. The agent makes it too easy to ask questions and comparison shop.
  That is precisely the point.
  Chats may suck as an interface, but majority of the value and promise of end-user automation (and more than half the point of the term "User Agent" (as in, e.g., a web browser)) is in enabling comparison shopping in spite of the merchants, and more generally, helping people reduce information asymmetry that's intertwined with wealth and power asymmetry.
  But it's not something you can generally sell to the vendors, who benefit from that asymmetry relative to their clients (in fact, I was dumbfounded to see so much interest on the sales/vendor side for such ideas, but I blame it on general AI hype).
  Adversarial interoperability is the name of the game.
  [-]
  - froggit 13 minutes ago
    Not sure you're aware but you initially sound like you disagree with the post you replied to, only to follow up by enthusiastically reiterating that author's words as if in agreement.
    You realize what shoppers and vendors each consider to be "good" e-commerce sites are fundamentally opposed concepts?
    [-]
    - TeMPOraL 8 minutes ago
      Maybe? I'm not sure which way the OP is arguing, in particular because of that "(Good)". So perhaps I misread the comment as arguing the opposite of what it is.
  - busymom0 41 minutes ago
    https://www.mcmaster.com/
    [-]
    - bandrami 11 minutes ago
      Just made an order from them. It's weirdly comforting to know there's a company that knows I need clevis bolts and is willing to sell them to me for a transparent price.
    - jaapz 12 minutes ago
      Wow, that's amazing
    - TeMPOraL 22 minutes ago
      It's the one, thank you!
- exegete 1 hour ago
  The article says Walmart is not abandoning ChatGPT but is going to use their own app in Chat’s ecosystem
- locknitpicker 53 minutes ago
  > (Good) E-commerce has been ruthlessly optimised to get shoppers to products they'll actually buy and then remove all distractions from buying.
  I don't think so. I know for a fact that search terms are a minefield of gotchas and hacks caused by product decisions that reflect ad-hoc negotiations with partners and sellers. It's an unstable equilibrium of partners trying to shift attention to their products in a certain way. I think that calling this fragile equilibrium optimized has no bearing with reality.
- TZubiri 1 hour ago
  It depends on the product, if we are talking commodities or mass produced products like groceries, sure.
  If we are talking custom products or complex appliances that need a lot of guidance, then maybe chat interface is appropriate.
  [-]
  - bashkiddie 1 hour ago
    When I shop for special hardware (e.g. bicycle shift gear) it is usually underspecified. If the information does not exist in the text block, a chat bot is of no use.
    [-]
    - TeMPOraL 18 minutes ago
      Chat bots don't belong to an e-commerce site; chat bots belong on the outside, specifically to comparison-shop and pull in some external information to de-bullshitify offers, correct "mistakes" and "accidental omissions" in the listings, resolve the borderline-fraudlent crap companies play these days with store-specific and season/promotion-specific SKUs with different parameters all resolving to same model/make name (think Black Friday/Cyber Monday deals that are not actually deals, just inferior hardware with dedicated SKU).
    - TZubiri 22 minutes ago
      The use case for chat interfaces would be as follows:
      Grandma wants to buy a good bike, but doesn't know about types of wheels or how many gears they need, or what type of frame is appropriate for their body type.
      [-]
      - TeMPOraL 16 minutes ago
        Reliable information on this does not exist on vendor sites, though. It exists on Reddit and in books and in med/physio papers and bunch of other places a SOTA model has read in training or can (for now) access via web search.
        LLMs are already very good for shopping, but only as long as they sit on the outside.
  - pjc50 1 hour ago
    The problem is that the chat transcript is legally binding. If the chatbot makes incorrect statements which the customer relies on for their complex purchase, then you're going to have to refund them.
    https://www.bbc.co.uk/travel/article/20240222-air-canada-cha...
Lerc 1 hour ago
Is their issue that ChatGPT served their customers more than it served them?
[-]
- Vespasian 1 hour ago
  To early to say that but it's certainly a part of the equation all vendors are currently looking at.
  And given the past few decades there is no reason to not try to do that.
- d--b 58 minutes ago
  That's definitely it.
  This probably means that OpenAI et al are fine-tuning salesman-like LLMs to "fix" that problem.
  Can't wait for the future.
qoez 25 minutes ago
It's probably stuff like this along with investor pressure that will make AI companies slowly make their AIs more profit maximizing (and the long term reason ilya etc was so against even going down that path)
keiferski 46 minutes ago
I don’t trust AI bots to access my wallet. Not sure I ever will.
I sort of trust them to make product recommendations, but at best I will only open a link they suggest and buy the product there.
[-]
- falcor84 22 minutes ago
  > my wallet
  Does it actually need direct access to your wallet? I haven't tried it yet, but assumed it would work with a separate wallet, fed through by top-ups.
  [-]
  - keiferski 18 minutes ago
    Sounds like they are an intermediary between the user and the business. So it’s not a top-up thing. That would be needlessly complex from ChatGPT’s end, too.
    https://chatgpt.com/merchants/
- everdrive 43 minutes ago
  >I sort of trust them to make product recommendations
  Never, ever.
  [-]
  - raincole 36 minutes ago
    Your argument is "they're designed to influence us" right?
    Amazon reviews are paid influence. Reddit posts are paid influence. Everything everywhere you read online is paid influence. I'd rank LLMs between "people I personally trust" and "random people online."
    [-]
    - vulcan01 18 minutes ago
      Since LLMs are trained on "random people online", why are they not of equal rank?
  - keiferski 39 minutes ago
    I trust them as much as any other online source, which is to say, sort of, but only as a starting point for research.
    Do you have a better alternative?
  - crooked-v 34 minutes ago
    Even if they're (somehow) bias-free, you're still stuck with "the state of the internet circa 20XX" from the training data.
hexasquid 1 hour ago
Last year they couldn't draw fingers on hands properly, this year they can't convert at checkouts, I wonder what they'll be failing to do a year from now.
[-]
- bashkiddie 44 minutes ago
  Next years headline will be
  > AI accounts for 90% of accidents while only accounting for 1% of traffic
  [-]
  - stavros 23 minutes ago
    Waymo cars cause 90% fewer accidents than human drivers:
    https://abit.ee/en/cars/waymo-robotaxi-autonomous-driving-sa...
firefoxd 1 hour ago
The experience is a lot like when you are talking with a friend, then they decide to ask siri or google a question using voice. The result is always imprecise. Meaning they either have to repeat their query, or end up typing it anyway.
If you want to buy a Walmart product, the easiest way is to go to Walmart. Why add an imprecise middle man in between?
brador 11 minutes ago
Original AI was sourced from university level text books, stack and wikipedia. Average iq of writer around 140. The latest AI is trained on the average citizens social media output. Iq 90.
That’s why AI seemed smart. The bar will not be raised again. We’re cooked.
zmmmmm 45 minutes ago
> Walmart will embed its own chatbot, Sparky, inside ChatGPT. Users will log into Walmart, sync carts across platforms, and complete purchases within Walmart’s system.
The enshittification is upon us.
[-]
- netsharc 26 minutes ago
  Hah, Clippy's cousin Sparky: every once in a while after ChatGPT answers a question it'll say "Looks like you still have stuff in your WalMart cart. Would you like me to complete that checkout for you? Also, WalMart-brand diapers is on offer this week, shall I add that to your cart?"
kvisner 3 days ago
That doesn't seem terribly surprising, a human can quickly look through a grid of shirts to find one they like. ChatGPT would be guessing what they might want and the human would probably get a bad experience there with some regularity.
fennecfoxy 54 minutes ago
Not really many details...
Perhaps clickthrough is worse because there are fewer dark patterns involved and people are mostly just browsing and occasionally buying only what they need.
They didn't really seem to specify the "why" of it with any research. And weird that OAI wasn't supporting them to see wha the issue was.
casey2 1 hour ago
Because most people can't read. Wait for agents generating personalized websites/self checkout apps.
charcircuit 1 hour ago
What if they made instant checkout actually instant? You ask ChatGPT to setup a website and it instantly purchases web hosting and sets up the website there. You can't beat a 100% conversion rate by actually checking out instantly. If you didn't like that host you can ask it to find it alternative and ChatGPT would automatically attempt a refund and then purchase from someone else.
[-]
- LoganDark 1 hour ago
  I'd become afraid to ask that bot anything, not knowing what it would automatically try to purchase for me. Paying for the bot itself is already a lot: $20/mo to $200/mo or more.
  [-]
  - lukan 37 minutes ago
    20$ for a personal assistant is not much, but no, I surely don't trust those assistants to access my money.
- TZubiri 1 hour ago
  What are we talking about here? ChatGPT as an interface to buy groceries? Or ChatGPT as an interface to build a website. I fail to see how these would be related other than using a specific technology
  [-]
  - wrqvrwvq 11 minutes ago
    > Because most people can't read. Wait for agents generating personalized websites/self checkout apps.
    > You ask ChatGPT to setup a website and it instantly purchases web hosting and sets up the website
    Multiple comments deflecting from the original shopping conversion failure to recommend ... building a whole new website (with hosting for some reason?). W/o bothering to look through commenter history, one has to assume there are a lot of chatbots on this site or else the people using this stuff have been lobotomized.
    I'm sure it'll start happening too, and when that fails, the bots will, i don't know, invent a new macarena. We are definitely headed for an irredeemably stupid future.