A blog post like this is half the story. I’d like to see the results. Did your brother get more business? What were the failure modes? Did customers care if it was a bot or not?
It also ignores easily available solutions that could have been deployed prior to AI
For example, even if it shows a boost of $100,000 per month in revenue. It could likely have been achieved with a shared virtual assistant / receptionist for about $200-1000 per month (depending on exactly call volumes).
So really, the revenue was already lost and going forward you’re just deciding to capture it. You've created a more complicated mouse trap than what was already available to you. The difference is saving a couple hundred dollars of labor less whatever your AI/tech costs are. I’d still go the human route because it’s more future proof and if this is a luxury service, human service is always going to feel more luxurious.
Could he not just increase the price until the number of calls matches the time he has?
I know it's not that simple, but my gut says theres value to at least hearing out the people taking action to call you. Especially if that's automated and low cost to you.
This is such an important point. My plumber that we always call is extremely busy and usually doesn't have availability for at least a week. He is a one man shop and prefers it that way. You call his phone, leave a voicemail and he calls you back whenever he is able to. I asked him if he wants to get more business by automating his incoming calls and he said "not really, I am already very busy and have enough business. I don't need these tools".
So we cannot always assume that the business owner (especially the solo mom and pops) wants more business. Good ones are already very busy.
This seems to be true with every trade shop in my area. HVAC, plumbing, electrical, landscaping, appliance repair, and so on: Nobody picks up the phone, and when you do get someone, they don't seem to be very interested in your job unless it sounds like big money to them. Everyone already apparently has as much work as they want, and if you're a small fish you're out of luck.
That's wild. Plumbing especially seems like a field where if you need a plumber you need them right now, not a week from now.
I guess as a plumber having enough of the type of jobs that can wait a week that you can turn away the urgent calls might be one of those feature-not-a-bug type situations.
I tried getting some work quotes not long ago and was surprised by how many local shops still don't have: (1) website that takes info and emails/calls back, (2) voicemail, or (3) having one or both of those and didn't call back all week. I suspected they had all the business they can handle. I did get a call back later in the week from one that said as much.
Why would a car repair shop need a website for? All I care about is the phone number, with the hope that someone will pick it up. IDK about the world, but in Poland every single mechanic I know has no downtime at all. The better ones have queues measured in weeks or months for simple repairs. They don't care about extra business, the business will find them anyway.
I needed to replace my car's windshield in a hurry while on an extended trip. I called around to see who might have one in stock that could do a rush order. There was one place that had an automated voice system, and I hung up because it kept redirecting the conversation to get me to hand over more information than necessary to answer my question.
If I were already an existing customer and just wanted to schedule an oil change, it'd be fine, though I'd probably just schedule on the website anyway. I'm really only going to call in if I have an unusual circumstance and actually need to speak with someone.
Automated voice systems that try to sound human but are in fact purely scripted are insanely annoying. E.g. "I think you said 'windshield', is that correct? ... Got it, thanks!"
If you only have 4 options, just give me the old school list of voice options and I'll press 1 through 4, in less time, and being only moderately annoyed.
But a knowledgeable AI system as described in the article - that knows what it knows and tells you when it doesn't - could work great. If it had access to inventory and calendar, it might have worked for you. The question is whether the implementation lives up to the high expectations set by the articles.
Me too, but I wonder whether we're in the minority here. I'm sure there must be plenty of people who just call places to get information easily found via the web, or there wouldn't be so many automated phone systems that explain how to get information via their website.
I know someone who works on the voice response system for $LARGEBANK. She says that more than 95% of calls are just to find out a checking account balance.
That's fine, and there's no need for AI pretending to be a human, or to ask me to talk to a computer as if it is a human. Routine decision trees work really well here.
In fact, decision trees are nice because they tell your more or less up front what they're capable of.
What really sucks (AI or decision tree, either way) is when they don't let you easily speak with someone.
I'd argue a well designed AI assistant would be considerably better than a decision tree for that use case. Decision trees are slow because you normally need to wait through several options before getting to the one you're interested in. (Though sure, perhaps not if your call is literally for the most common thing.) But with an AI you could jump straight to what you're interested in.
"Hi, I'm the LargeBank AI Assistant. How can I help you?"
"I'd like to know the balance of my checking account."
And then authenticate and get the balance as usual. Simpler and faster.
Agreed that it becomes a problem if it's seen as a replacement for human agents though. In an ideal world it would actually free up the human agents for when they're actually needed. In reality it'll probably be some of each.
I believe that. Probably 95% of my support calls to online shops are about order status (aka: the website shows "in preparation" for a week already, I need to talk to a real person).
Maybe I am in the minority here, but I appreciate the new crop of LLM based phone assistants. I recently switched to mint mobile and needed to do something that wasn't possible in their app. The LLM answered the call immediately, was able to understand me in natural conversation, and solved my problem. I was off the call in less than a minute. In the past I would have been on hold for 15-20 minutes and possibly had a support agent who didn't know how to solve my problem.
Also I bet the LLM didn't speak too fast, enunciate unclearly, have a busted and crackly headset obscuring every other word it said to you, or have an accent that you struggled to understand either.
I was on the wrong end of some (presumably) LLM powered support via ebay's chatbot earlier this week and it was a completely terrible experience. But that's because ebay haven't done a very good job, not because the idea of LLM-powered support is fundamentally flawed.
I had a similar situation with a chatbot: I posted a highly technical question, got a very fast reply with mostly correct data. Asked a follow-up question, got a precise reply. Asked to clarify something, got a human-written message (all lowercase, very short, so easy to distinguish from the previous LLM answers).
Unfortunately, the human behind it was not technically-savvy enough to clarify a point, so I had to either accept the LLM response, or quit trying. But at least it saved me the time from trying to explain to a level 1 support person that I knew exactly what I was asking about.
My big question is. Why has the company and their development process failed so horribly they need to use LLM instead the app? Surely app could implement everything LLM can too.
Amazon support does this pretty well with their chat. The agent can pull all the relevant order details before the ticket hits a human in the loop, who appears to just be a sanity check to approve a refund or whatever. Real value there.
Didn't work for me. I had a package marked delivered that never showed. The AI initiated a return process (but I didn't have anything to return). I needed to escalate to a human.
Agreed; they're far better than the old style robots, which is what you'd have to deal with otherwise.
More generally, when done well, RAG is really great. I was recently trying out a new bookkeeping software (manager.io), and really appreciated the chatbot they've added to their website. Basically, instead of digging through the documentation and forums to try to find answers to questions, I can just ask. It's great.
What could the LLM be doing that wasn't possible inside the app? At the end of the day, the LLM is just making an API call to whatever system needed to be updated anyway, that could have just been a button in an app.
Just to be clear, the LLM assistant could be a great supplement to the app for people with disabilities or those who struggle with phone apps for whatever reason, but for most people the LLM phone call seems worse.
There's plenty of time for me inside the Amazon app where I'll click the button to get a refund or replacement on an order and go through the little radio options wizard to select the reasoning, and it will tell me it's not eligible for a refund in the end.
I'll switch to the AI chat where it lets you select your order and I'll do the same thing, and it has no issue telling me it can give me a refund and process it instantly.
So my case, the two seem to behave differently. And these are on items that say they're eligible for refunds to begin with when you first order them.
I think the point is: If there is an API somewhere in Company's systems that does what the customer wants, why have a phone tree or an LLM in the way? Just add a button to the app itself that calls that API.
i genuinely don't get the point of this. isn't it easier to have a native chat interface? phone is a much worse UX and we simply use it because of the assumption that a human is behind it. once that assumption doesn't hold - phone based help has no place here.
>and he’s losing thousands of dollars per month because he misses hundreds of calls per week. He’s under the hood all day. The phone rings, he can’t answer, the customer hangs up and calls someone else. That’s a lost job — sometimes a $450 brake service, sometimes a $2,000 engine repair — just gone because no one picked up.
How much does it cost to have an outsourced receptionist? Even if it is 500 a month if we are really talking about thousands of dollars per month lost your ROI is still crazy.
Never mind an outsourced receptionist, some of those calls could be handled simply by the mailbox. Of course, some people will hang up once the mailbox message starts - but then again, some will also hang up once they realize they're talking to an AI chatbot, so...
This is the critical data —» how many people hang up on the AI chatbot vs how many people hang up on the voice message prompt.
If it is even close, well, the AI needs to be improved.
If the AI is way ahead, but still loses/drops more than a live receptionist (outsourced or in-house), the AI either needs improvement, or to be dumped for a live receptionist, and that's kind of a spreadsheet problem (how many jobs lost in each case, vs costs).
I think the question of lost opportunities versus costs is the best thing to look at here. You could pay a receptionist like 50-60k a year but they have to bring in the work. Maybe the AI dumps a percentage over a real receptionist but they still bring in more than the mailbox. But there's a cost to the AI too.
But the real question you should also ask is what else can that human do for you that the AI can't because they have eyes and ears and hands?
The question is more why employ a full time receptionist when fractional services are available and it’s an old well established industry. A couple hundred dollar a month could employ a human only when the phone rings and to schedule their visit plus any FAQ. I’m sure Ruby.com already has plenty of auto shop customers.
He will still need a pipeline of work to keep himself and his team busy. Someone has to do that job, if clients are self selecting then it makes sense to automate it if possible.
I have a friend who runs a trade with an outsourced reception - they employ 3 full time people and the reception is about £150/mo for 9-5 manning of calls. He does the scheduling in the evenings.
If we take OP’s post at face value, presumably his brother is already at 100% capacity otherwise he wouldn’t be missing all these calls.
No - £150/mo for the service. I asked him and he said they take the calls, write up notes and he handles the callbacks/etc himself.
I don’t know if he’s “tested”, but he said he’s happy enough with the service. We don’t always have to AB test every possible option - sometimes good enough is good enough.
Well presumably 150 is what you pay to use the service, and they have like 100+ companies using it.
The model is exactly like Planet Fitness or similar gyms: It doesn't work if everyone visits at once, but you plan on most people using it once a week.
Yeah exactly - I don’t know how many calls he gets but it’s less than an amount to employ a full time person, but more than enough that it’s worth having someone to pick up a phone he can stay on the job.
A good 'Service Writer' (the term you use for this job) isn't cheap and typically aren't outsourced. Usually because your (local) competition is going to be using them too. And also because customers aren't going to trust a person that is writing service for multiple shops.
That said, a good service writer is worth their weight in gold. Also, they are typically going to be the person you end up selling the business to when you retire. Most mechanics aren't good enough at the business side of things to actually buy, but service writers are.
Generally, by whether they know what’s going on at the shop. Usually if I’m calling on the phone, it’s for a specific answer that’s not gettable through a computer.
“Hey can you look out and see if Joe’s almost done with the blue Chrysler?” is an easy ask for the phone answerer at my local Joe’s shop (it’s his wife, and as a bonus she’ll also holler at him or his crew to hurry up because @alwa is waiting on it).
Contrast with the grant-funded pharmacy I use. Some management type suggested they could deal with their insane level of overwork by automating away the phones to a hostile and labyrinthine network of IVRs. Oh, it has “AI,” but only to force choices between forks in decisions trees corresponding to questions I didn’t have—and every path still eventually ends in “this voice mailbox is full, goodbye.”
After literal hours of my life trying to wrestle their IVRs into helping—I do sympathize with their workload and don’t want to be a special snowflake—I now drive 30 minutes to ask questions face to face.
In general I’ve maxed out what’s discoverable by automated means before I call. So a call center is both useless and insulting.
Responsible (directly or indirectly) for quite a few of them, mostly oldish and wheezy; I’m not myself mechanical; and we use the shop mostly for routine maintenance—rotate the tires every few thousand miles, swap the brake pads, deal with the oil changes/fluids/filters, etc.
Partly as a preventative measure: we trust them. In the rare cases when they find something, it’s real. As a consequence we get ahead of brewing problems.
Plus loyalty, to some extent; we try to throw work their way when we can, even if we probably could handle it ourselves. The relationship between our families goes back a good 60 years by now.
Fully grant that my situation is unlikely to be representative. And no shade toward OP—it sounds like a cool project thoughtfully done, and a real improvement over the status quo for her relative!
Would redirecting them to a website where they can go through a guided intake and get some confirmation of a callback? A well designed UI that allows them to ID their vehicle (make/model/year) and the issue they're having? HTML5 has decent speech to text out of the gate and they can just talk it out.
Obviously that process could happen purely via voice but I think there's not as much love for walking through forms in a phone call.
If Joe has a PC in the shop with a tailored UI, he could get pings of pending requests and when he comes up for air, update the intake (via voice to minimize greasy hands) and initiate a call back then and there?
Related - Monro Muffler Brake apparently switched to an offshore call center model to handle scheduling every single auto shop location. I hear nothing but complaints from both fellow customers as well as the shop managers themselves about their local phone number being ripped away from them and handed off to a call center to try to schedule vehicle service from thousands of miles away.
Yes - but as others have mentioned you wouldn't have good advertising material for your AI "readiness" courses.
More to the point - does this garage even have the time and space to service more vehicles? Generating a bunch of new low-value/low-loyalty customers takes up time and space and might have a lower return-per-hour while making it harder to retain higher value returning customers.
Additionally, as "luxury mechanic" (apparently specializing in BMW but servicing other makes) you'll need to appeal to "luxury drivers" and bolting on more crap that makes the experience worse is probably not the way to do that.
I build software for contractors (plumbers, electricians, HVAC repair, etc) and they're some of the fastest adopters of these systems. I believe YC has even invested in a few.
Regarding the AI receptionists, from the calls I've listened to, there's still a bit of the uncanny valley/overlapping speech issues that I'm unsure are ever fixable just due to latency.
But for low margin businesses like contracting and (I imagine) auto repair where labor is your most expensive cost, these owners are doing anything they can to reduce their overhead.
I recently fired a plumber I was trying to contract for a five figure remodel job because his AI receptionist couldn't understand my address and therefore could not schedule the appointment. After that experience, I will not use a contractor that I cannot personally get in touch with until these systems improve demonstrably.
If you didn't have a sibling to do it for you free/cheap, I wonder how many months of a human receptionist (or service) the fee to build (and maintain) such a thing would cover.
Is RAG even necessary here? Minimal information like a couple of price list with job times and opening hours should easily fit into any context window, right? It's not like he's dumping entire service manuals into the vector database here...
It most likely isn't, but it seems like this project was more for learning purposes than for anything else. In that case, why not go for the "production-ready", "highly scalable" solution? I sometimes do the same for my personal projects. I over-architect them not because it's necessary, but because I want to get my hands dirty and learn something new.
I think the biggest thing is to not give it access to anything like a shell (obviously), limit the call length, and give it a hangup command.
Then you tell it to just not answer off the wall questions etc. and if you are using a good model it will resist casual attempts.
I don't see being able to ask nonsense questions as being a big deal for an average small business. But you could put a guardrail model in front to make it a lot harder if it was worth it.
in general these types of attacks are still difficult to solve, because there are a lot of different ways they can be formulated. llm based security is still and unknown, but mostly i have seen people using intermediary steps to parse question intent and return canned responses if the question seems outside the intended modality.
At the moment I'm pretty inclined to hang up if I feel I'm wasting my time with a robot.
But maybe soon we will not even realise we speak to a robot, given the current speed of ai development.
I wonder how that will erode trust in calls. I moved from cold emailing and cold LinkedIn to cold calling because of the massive amounts of ai spam I have to compete with. But maybe cold calling will die soon as well if the robots emerge.
I've been car shopping recently, and it took me a full week to realize that every dealership I'd talked to had an LLM with a fake name handling customer intake. I was 4 emails deep with one before I stopped to think about how plausible their near-instant response times were.
I assume people are pissed off because its building something that people already hate and its a fully AI generated post that is jarring to read.
Nothing pisses people off faster than calling up and getting put on the line with a robot. Like if we're thinking about this problem and how to solve it we can look at other examples like a website with a booking form,call the mechanics cell directly, hire a receptionist or worst case outsource the receptionist to a booking agency.
The alternative here isn't talking to a person. The alternative is leaving a voicemail and praying for a callback. Likely, you don't even leave a voicemail and a match is not made.
Asking a business to hire a receptionist is probably a bit unlikely for small businesses in today's environment.
Voicemail alone doesn't have information about someone's schedule.
"I'd like to schedule a smog check tomorrow or Wednesday?" rather than leaving a message and hoping for a callback that you don't miss either (and have go to voice mail).
Consistency of interface. I've got the phone number of the mechanic I go to in my phone's address book... and the various medical services for appointments there.
If they were to have an app on their website, I wouldn't know because I don't use the webpage for that purpose - I call them.
Now, they've all got receptionists there that work full time and handle the appointments and take that first tier of service. These are larger places that have two receptionists working the full day (handling walkins, calling confirmations, and the other administrative tasks)... I don't think that an LLM (even with access to appointments) would do a better job than what they do (and certainly wouldn't be able to do the "ok, I showed up, now what do I do?")
However, I could see this for a small mechanic shop. When I lived in California, I went to what is now Shoreline Auto Care on El Camino and Shoreline - a small two bay mechanic... and that's not the type of place that has the business that can afford a full time receptionist.
So the question for a place like that... "what do you get for the phone calls you miss?"
That makes sense, but maybe the UX problem runs a bit deeper. Maybe contacts apps should surface websites higher in the UI for saved businesses?
Running a small website with a calendar booking link just sounds much easier, cheaper, less error prone, and a better UX than running a voice LLM that is connected to a RAG and calendar. And I still don't think the technology around us has been built to support small websites or small businesses.
You're probably correct in that (edit: re-reading this... no, I'm not an AI - some people write this way and I tend to prefer to defuse potential arguments where two people are arguing for the same thing in a thread). Though I would think of the voice LLM system more as a smart answering machine rather than a complete replacement of calling the shop. The normal (preferred) course would be for one of the human staff there to pick up the phone... say before the 5th ring. On the 7th ring, it goes to voicemail... or to the voice LLM augmented voicemail.
If the LLM augmented voicemail is not much more than the business voicemail service that such places have now, is it enough value add?
That also implies other things - such as the capability to integrate with the calendar and appointment system which I'm still in the very hesitant side, but it could be an interesting service add on if it was properly limited.
If they can make the AI ajudicate the knowledge of the caller, I'm more than all for it.
"Hmm, this user seems to really understand network topology, better get him over to engineering"
vs.
"Hmm, the user doesn't know the difference between their router and their modem, I should help them identify the router then walk them through a power cycle".
The poster has built something that, while technically interesting, is profoundly annoying as a user and deserves to be backlashed to prevent more of this kind of stuff to be built
Just how much effort even went into this? The project is LLM generated, the blog post is LLM generated. It produced something that is really annoying to deal with as a consumer. The last thing I want to talk with when calling a boutique garage is some AI receptionist.
It's also just easier. When I needed a service for my car (and I didn't already have an established shop where to take it), I just wrote what I was looking for once and emailed the same thing to multiple different places at once.
If I had to call four different places and spend five minutes on the phone with each shop, that'd eat up my entire lunch time.
Think about scaling this as you're building, your brother is just your first customer, make sure your service works with any number of customers out of the gate. I should be able to sign up for your service, point it at my website to ingest all my information, and have it ready to go.
Thanks for sharing the journey. What did you do in terms of security for the receptionist? I suspect someone can trick the agent through things like prompt injection.
No idea what `luxury` is doing here, but if I get an LLM receptionist, that ain't it.
This isn't to disparage the project - I think this sort of usage will become very common and a decent standard that produces good consumer surplus in terms of reduced costs etc. Especially impressive is that it's a DIY family-first implementation that seems to be working. It's great hacker work.
But be warned it will erode - in general - the luxury previously associated with your brand, and also turn some customers away entirely.
In America the normal term is "European", not "luxury".
It would be somewhat odd to specialize in both American and European luxury cars. It'd be significantly less odd to service a RR and a BMW 3er next to each other.
The actual company’s website says European, not luxury. My guess is that the OP wasn’t familiar with this distinction and just figured luxury means the same thing (the car shop is his brother’s as per the link.)
Admittedly I missed this distinction, but does the point still stand?
A BMW owner has fussier standards (on average) than a Toyota owner. The 'higher touch' a service you're trying to provide, the less welcome these interventions will be. If there's a distinction between a normal-car garage and a luxury-car garage, this probably comes down to some sort of licensing or certification from those luxury brands. Seems plausible to me that luxury brand X could stipulate things like availability of human contact points.
Re: not being a car mechanic, it's true, but I'll have you know that I replaced my own blower motor a few months ago :)
This isn’t accurate. Lots of types of people own older used European/luxury cars, it’s not just a rich people thing. Used BMWs especially aren’t that expensive compared to new cars.
This garage is for those older cars and has no connection to the actual manufacturers, so there is no licensing required.
Sure and just to add a funny anecdote here: a family member of mine used to own a 1980s Jaguar. Beautiful car and he probably paid $5,000 for it, but it had issues pretty much every month. His reasoning for keeping it was that the monthly repair costs were roughly equivalent to what a new car payment would be.
I agree with you on the dealership dynamics though.
> No idea what `luxury` is doing here, but if I get an LLM receptionist, that ain't it.
Bingo.
You can't get away with AI slop in a service oriented for wealthy customers.
The day my dealership starts answering me with AI they lose a customer 100%.
This solution screams "built by a tech bro with no idea about economics and marketing" which is the VC playbook into modernizing (and failing) businesses they don't understand.
You are right, but this also isn't a luxury mechanic shop. A luxury mechanic shop would be a place that services and customizes Bentleys, RRs, vintage Ferraris and similar. And to your point, the clientele there will be extremely unimpressed if they are asked to speak with an AI. A place like that is as much about being pampered by staff as about the workmanship.
OP's brother is by all accounts running a successful boutique workshop, but the various luxury annotations were completely unnecessary and just detract from the actual project. If they do want to lean into the luxury segment, being cheap with AI receptionists is not the way to go. They need to hire actual staff who has experience with HNW individuals.
Honestly great work, but this is very much something where the results matter more than the product. It ends without a single comment about whether it worked in Production.
How are they measuring the success rate? It seems like a project like this is a great time to dive into the problem and define the parameters of success. If only to inform how you design the ai’s presentation of the shop. Ie. how quickly does it get customer’s profile and discover their issue.
Thinking about my experiences with mechanics shops—with the exception of dealerships and larger operations—if you’re talking to a principal, the conversation is brief. It’s possible customers will respond positively if the bot is effective for scheduling and if the price communicated by phone, and the final price are somehow aligned to expectations.
If it’s anything like talking to ChatGPT via voice they’d definitely notice. And if it has anything like the failure modes it does, the OP’s brother is going to eat into a lot of the cost savings he’d get (vs using a human receptionist or even an outsourced receptionist) dealing with fires like the AI said my car would absolutely be done today.
Ignore the expected negativity, many here have not used the latest gen of voice agents in development. Even if used as a router , prefer that to waiting to get through
I was agreeing with all the nay-saying comments, but yours made me see the idea as good. I guess the word "luxury" ruined it for OP.
But a speech-to-text and text-to-speech system that I know is "understanding" me would be great rather than waiting music. The shop could even sell it as "As a small shop, most of our employees are busy fixing cars, so we are using AI to help with calls" (Although then people who are anxious about AI stealing jobs might hang up). The robot can ask me what I need, and then say "So for [this service], the price would be..." (to tell the caller what it has understood).
If the AI can even look at gaps in the shop's schedule and set an appointment time, the customer might even be happy that they just spent a minute on the phone instead of 10+...
I would rather just be sent to a regular old answering machine. Dealing with an AI is dehumanizing. In almost every single case where I actually need to call a place, its because I need to talk to them about something an automated system like booking an appointment, can't handle.
I know it's intended to be dismissive, but I would appreciate the choice.
Even if the new model that came out last week totally fixed all the problems this time for real, most people's experience with chatbots is that they are prone to misunderstanding or making false statements. "Hallucinations"
I have yet to experience any degree of confidence in any output from an LLM, so I'd rather leave the message. I don't know how common this point of view is.
brutal market for lemons: the last 100 times they heard robovoice on the phone they had a terrible experience, and any money you spend fixing this is wasted because the customer cant tell your robovoice is actually honest and capable of making commitments because they all sound perfectly confident and correct even the ones who know nothing and will promise anything
The responses here remind me how much of a bubble we are in on HN. "I hang up when I realize I am talking to a bot", "I would rather email". I think a lot of non-tech-savvy people would rather not send an email or realize they are talking to a bot.
I think we can solve this as a society by just making it clear that if you put an AI between you and your customers, you are absolutely bound by anything it offers them.
> Wired up Claude for response generation — The retrieved documents get passed as context to Anthropic Claude (claude-sonnet-4-6) along with a strict system prompt: answer only from the knowledge base, keep responses short and conversational, and if you don’t know — say so and offer to take a message. No hallucinations allowed.
Claude will hallucinate anyway, sometimes.
I don't think there's any way around this other than a cli or MCP that says "press the 'play prerecorded .WAV file button that says the brake repair service info and prices.'"
That made me laugh a bit as well. Definitely want to see some rigorous testing on that, I'd expect that on longer calls tha caller can make the ai say basically anything.
I admire what you have done, but for a luxury experience, I do not want to talk to an AI that just tells me what is already on the website. If I have gotten to the point where I am calling you, its because I couldn’t find an answer to my question on the website in the first place.
For example, even if it shows a boost of $100,000 per month in revenue. It could likely have been achieved with a shared virtual assistant / receptionist for about $200-1000 per month (depending on exactly call volumes).
So really, the revenue was already lost and going forward you’re just deciding to capture it. You've created a more complicated mouse trap than what was already available to you. The difference is saving a couple hundred dollars of labor less whatever your AI/tech costs are. I’d still go the human route because it’s more future proof and if this is a luxury service, human service is always going to feel more luxurious.
> He’s under the hood all day. The phone rings, he can’t answer, the customer hangs up and calls someone else
the mechanic is already very busy in the first place so unless he plans on expanding shop the whole thing is a waste of time
I know it's not that simple, but my gut says theres value to at least hearing out the people taking action to call you. Especially if that's automated and low cost to you.
So we cannot always assume that the business owner (especially the solo mom and pops) wants more business. Good ones are already very busy.
I guess as a plumber having enough of the type of jobs that can wait a week that you can turn away the urgent calls might be one of those feature-not-a-bug type situations.
Not everyone works all three or wants to do more than one of these groups. There’s different levels of demand, pay, competition at each.
If I were already an existing customer and just wanted to schedule an oil change, it'd be fine, though I'd probably just schedule on the website anyway. I'm really only going to call in if I have an unusual circumstance and actually need to speak with someone.
If you only have 4 options, just give me the old school list of voice options and I'll press 1 through 4, in less time, and being only moderately annoyed.
But a knowledgeable AI system as described in the article - that knows what it knows and tells you when it doesn't - could work great. If it had access to inventory and calendar, it might have worked for you. The question is whether the implementation lives up to the high expectations set by the articles.
In fact, decision trees are nice because they tell your more or less up front what they're capable of.
What really sucks (AI or decision tree, either way) is when they don't let you easily speak with someone.
"Hi, I'm the LargeBank AI Assistant. How can I help you?" "I'd like to know the balance of my checking account."
And then authenticate and get the balance as usual. Simpler and faster. Agreed that it becomes a problem if it's seen as a replacement for human agents though. In an ideal world it would actually free up the human agents for when they're actually needed. In reality it'll probably be some of each.
I was on the wrong end of some (presumably) LLM powered support via ebay's chatbot earlier this week and it was a completely terrible experience. But that's because ebay haven't done a very good job, not because the idea of LLM-powered support is fundamentally flawed.
When implemented well it can work great.
Unfortunately, the human behind it was not technically-savvy enough to clarify a point, so I had to either accept the LLM response, or quit trying. But at least it saved me the time from trying to explain to a level 1 support person that I knew exactly what I was asking about.
More generally, when done well, RAG is really great. I was recently trying out a new bookkeeping software (manager.io), and really appreciated the chatbot they've added to their website. Basically, instead of digging through the documentation and forums to try to find answers to questions, I can just ask. It's great.
If my mechanic answered with an LLM I’d take my car elsewhere.
Just to be clear, the LLM assistant could be a great supplement to the app for people with disabilities or those who struggle with phone apps for whatever reason, but for most people the LLM phone call seems worse.
I'll switch to the AI chat where it lets you select your order and I'll do the same thing, and it has no issue telling me it can give me a refund and process it instantly.
So my case, the two seem to behave differently. And these are on items that say they're eligible for refunds to begin with when you first order them.
in that medium, llms are so much better than old phonetrees and waiting on hold
Spoken word is still the most information dense way for humans to communicate abstract ideas in real time.
Reading > Listening
Speaking > Typing
If you want raw performance on both sides, It is better to dictate an email that gets read later.
How much does it cost to have an outsourced receptionist? Even if it is 500 a month if we are really talking about thousands of dollars per month lost your ROI is still crazy.
I think most folks already wouldn't be able to tell, with the modern TTS.
It's like AI photos, they fool you unless you're looking for it.
This is the critical data —» how many people hang up on the AI chatbot vs how many people hang up on the voice message prompt.
If it is even close, well, the AI needs to be improved.
If the AI is way ahead, but still loses/drops more than a live receptionist (outsourced or in-house), the AI either needs improvement, or to be dumped for a live receptionist, and that's kind of a spreadsheet problem (how many jobs lost in each case, vs costs).
But the real question you should also ask is what else can that human do for you that the AI can't because they have eyes and ears and hands?
If we take OP’s post at face value, presumably his brother is already at 100% capacity otherwise he wouldn’t be missing all these calls.
I don’t know if he’s “tested”, but he said he’s happy enough with the service. We don’t always have to AB test every possible option - sometimes good enough is good enough.
The model is exactly like Planet Fitness or similar gyms: It doesn't work if everyone visits at once, but you plan on most people using it once a week.
That said, a good service writer is worth their weight in gold. Also, they are typically going to be the person you end up selling the business to when you retire. Most mechanics aren't good enough at the business side of things to actually buy, but service writers are.
“Hey can you look out and see if Joe’s almost done with the blue Chrysler?” is an easy ask for the phone answerer at my local Joe’s shop (it’s his wife, and as a bonus she’ll also holler at him or his crew to hurry up because @alwa is waiting on it).
Contrast with the grant-funded pharmacy I use. Some management type suggested they could deal with their insane level of overwork by automating away the phones to a hostile and labyrinthine network of IVRs. Oh, it has “AI,” but only to force choices between forks in decisions trees corresponding to questions I didn’t have—and every path still eventually ends in “this voice mailbox is full, goodbye.”
After literal hours of my life trying to wrestle their IVRs into helping—I do sympathize with their workload and don’t want to be a special snowflake—I now drive 30 minutes to ask questions face to face.
In general I’ve maxed out what’s discoverable by automated means before I call. So a call center is both useless and insulting.
Partly as a preventative measure: we trust them. In the rare cases when they find something, it’s real. As a consequence we get ahead of brewing problems.
Plus loyalty, to some extent; we try to throw work their way when we can, even if we probably could handle it ourselves. The relationship between our families goes back a good 60 years by now.
Fully grant that my situation is unlikely to be representative. And no shade toward OP—it sounds like a cool project thoughtfully done, and a real improvement over the status quo for her relative!
Obviously that process could happen purely via voice but I think there's not as much love for walking through forms in a phone call.
If Joe has a PC in the shop with a tailored UI, he could get pings of pending requests and when he comes up for air, update the intake (via voice to minimize greasy hands) and initiate a call back then and there?
Christ just hire some local teenager or whomever. There's people who will work for minimum wage.
More to the point - does this garage even have the time and space to service more vehicles? Generating a bunch of new low-value/low-loyalty customers takes up time and space and might have a lower return-per-hour while making it harder to retain higher value returning customers.
Additionally, as "luxury mechanic" (apparently specializing in BMW but servicing other makes) you'll need to appeal to "luxury drivers" and bolting on more crap that makes the experience worse is probably not the way to do that.
Regarding the AI receptionists, from the calls I've listened to, there's still a bit of the uncanny valley/overlapping speech issues that I'm unsure are ever fixable just due to latency.
But for low margin businesses like contracting and (I imagine) auto repair where labor is your most expensive cost, these owners are doing anything they can to reduce their overhead.
Here’s the video: https://youtu.be/QmH9b27xm6k
It was very impressive at that time. They did raise money after that pitch, but they ended up pivoting (multiple times). They IPOd in 2017
Then you tell it to just not answer off the wall questions etc. and if you are using a good model it will resist casual attempts.
I don't see being able to ask nonsense questions as being a big deal for an average small business. But you could put a guardrail model in front to make it a lot harder if it was worth it.
* i'd love to hear a sample/customer call. Even if it's just a test
* a blog without rss? How can i subscribe for part 2?
But maybe soon we will not even realise we speak to a robot, given the current speed of ai development.
I wonder how that will erode trust in calls. I moved from cold emailing and cold LinkedIn to cold calling because of the massive amounts of ai spam I have to compete with. But maybe cold calling will die soon as well if the robots emerge.
I appreciated your post and have some takeaways around text formatting for TTS in my own projects. Thanks!
Nothing pisses people off faster than calling up and getting put on the line with a robot. Like if we're thinking about this problem and how to solve it we can look at other examples like a website with a booking form,call the mechanics cell directly, hire a receptionist or worst case outsource the receptionist to a booking agency.
Asking a business to hire a receptionist is probably a bit unlikely for small businesses in today's environment.
"I'd like to schedule a smog check tomorrow or Wednesday?" rather than leaving a message and hoping for a callback that you don't miss either (and have go to voice mail).
Being able to have a voice appointment scheduling system (assuming that it isn't being jail broken https://www.youtube.com/shorts/GJVSDjRXVoo ) could be useful... though there are problems with giving it agency over decisions ( https://www.bbc.com/travel/article/20240222-air-canada-chatb... ).
If they were to have an app on their website, I wouldn't know because I don't use the webpage for that purpose - I call them.
Now, they've all got receptionists there that work full time and handle the appointments and take that first tier of service. These are larger places that have two receptionists working the full day (handling walkins, calling confirmations, and the other administrative tasks)... I don't think that an LLM (even with access to appointments) would do a better job than what they do (and certainly wouldn't be able to do the "ok, I showed up, now what do I do?")
However, I could see this for a small mechanic shop. When I lived in California, I went to what is now Shoreline Auto Care on El Camino and Shoreline - a small two bay mechanic... and that's not the type of place that has the business that can afford a full time receptionist.
So the question for a place like that... "what do you get for the phone calls you miss?"
Running a small website with a calendar booking link just sounds much easier, cheaper, less error prone, and a better UX than running a voice LLM that is connected to a RAG and calendar. And I still don't think the technology around us has been built to support small websites or small businesses.
If the LLM augmented voicemail is not much more than the business voicemail service that such places have now, is it enough value add?
That also implies other things - such as the capability to integrate with the calendar and appointment system which I'm still in the very hesitant side, but it could be an interesting service add on if it was properly limited.
"Hmm, this user seems to really understand network topology, better get him over to engineering"
vs.
"Hmm, the user doesn't know the difference between their router and their modem, I should help them identify the router then walk them through a power cycle".
Why should people be impressed by this?
If I had to call four different places and spend five minutes on the phone with each shop, that'd eat up my entire lunch time.
This isn't to disparage the project - I think this sort of usage will become very common and a decent standard that produces good consumer surplus in terms of reduced costs etc. Especially impressive is that it's a DIY family-first implementation that seems to be working. It's great hacker work.
But be warned it will erode - in general - the luxury previously associated with your brand, and also turn some customers away entirely.
I assume the Op, being a programmer and not a car mechanic, just assumed they mean the same thing.
The entire discussion here about how AI undercuts luxury brands has absolutely nothing to do with the actual post.
It would be somewhat odd to specialize in both American and European luxury cars. It'd be significantly less odd to service a RR and a BMW 3er next to each other.
A BMW owner has fussier standards (on average) than a Toyota owner. The 'higher touch' a service you're trying to provide, the less welcome these interventions will be. If there's a distinction between a normal-car garage and a luxury-car garage, this probably comes down to some sort of licensing or certification from those luxury brands. Seems plausible to me that luxury brand X could stipulate things like availability of human contact points.
Re: not being a car mechanic, it's true, but I'll have you know that I replaced my own blower motor a few months ago :)
This garage is for those older cars and has no connection to the actual manufacturers, so there is no licensing required.
I agree with you on the dealership dynamics though.
Jaguar-of-Theseus
Bingo.
You can't get away with AI slop in a service oriented for wealthy customers.
The day my dealership starts answering me with AI they lose a customer 100%.
This solution screams "built by a tech bro with no idea about economics and marketing" which is the VC playbook into modernizing (and failing) businesses they don't understand.
OP's brother is by all accounts running a successful boutique workshop, but the various luxury annotations were completely unnecessary and just detract from the actual project. If they do want to lean into the luxury segment, being cheap with AI receptionists is not the way to go. They need to hire actual staff who has experience with HNW individuals.
Would love to see benchmarks on Mac Studio with its 7.4 GB/s SSD bandwidth — feels like the sweet spot for this technique.
How are they measuring the success rate? It seems like a project like this is a great time to dive into the problem and define the parameters of success. If only to inform how you design the ai’s presentation of the shop. Ie. how quickly does it get customer’s profile and discover their issue.
Thinking about my experiences with mechanics shops—with the exception of dealerships and larger operations—if you’re talking to a principal, the conversation is brief. It’s possible customers will respond positively if the bot is effective for scheduling and if the price communicated by phone, and the final price are somehow aligned to expectations.
However, does the regular "joe/jane" feel the same way? I imagine my mom or dad would most likely not notice or care if they did.
But a speech-to-text and text-to-speech system that I know is "understanding" me would be great rather than waiting music. The shop could even sell it as "As a small shop, most of our employees are busy fixing cars, so we are using AI to help with calls" (Although then people who are anxious about AI stealing jobs might hang up). The robot can ask me what I need, and then say "So for [this service], the price would be..." (to tell the caller what it has understood).
If the AI can even look at gaps in the shop's schedule and set an appointment time, the customer might even be happy that they just spent a minute on the phone instead of 10+...
A friend of mine worked for a call center that did car rentals, old people would call them and ask to rent a car.
Maybe the AI system should have "Press 1 to talk to AI, press 2 to leave a message" so experts like you can press 2.
Even if the new model that came out last week totally fixed all the problems this time for real, most people's experience with chatbots is that they are prone to misunderstanding or making false statements. "Hallucinations"
I have yet to experience any degree of confidence in any output from an LLM, so I'd rather leave the message. I don't know how common this point of view is.
Claude will hallucinate anyway, sometimes.
I don't think there's any way around this other than a cli or MCP that says "press the 'play prerecorded .WAV file button that says the brake repair service info and prices.'"
Like CMON this is the bare minimum here.