It is almost been a year since Microsoft pulled this plug, but the problem it forced me to think through has not gone stale. If anything it has gotten more relevant, because every deprecation notice right now points you toward the same fix: swap it for an LLM based service. Sometimes that is genuinely the right call. A lot of the time it is just what is shipping this quarter, and the cost bill shows up three months later when nobody is looking.
Microsoft retired the Bing Search APIs, including Spell Check, on August 11, 2025. If you were running a bot service that cleans user input before it hits your NLU layer, this probably broke something for you.
This is how we actually thought through the replacement. Not because our solution is the correct one, it has real tradeoffs I am not going to hide, but because the decision making process matters more than any specific stack. If you are a senior engineer being handed just replace the spell checker as a one line ticket, this is for you: it was never really a which library problem, it was a business constraints problem wearing a technical costume.
User messages went through a spell check pass before hitting the bot service. Small thing, big impact. Something like helo, I want too chnage my adress needed to become clean enough for the NLU model to classify with confidence. Bing Spell Check handled this well and we never had to think about it. Then Microsoft pulled the plug. We disabled the call rather than fail silently, intent recognition accuracy dropped, and someone above me said to fix it.
Microsoft's own retirement notice points to Grounding with Bing Search, part of Azure AI Agent Service. It is built for RAG scenarios: an LLM decides whether to search, pulls live web data, generates a cited response. It is not a spell checker, it is a full search and answer pipeline that happens to be the migration path Microsoft wants.
If your actual need is take a string, fix typos, return it fast, this is architecturally the wrong tool.
The LLM option is not wrong because LLMs are bad at spelling, GPT class models are decent at contextual correction. It is wrong because the economics do not match a task that runs on every message, forever.
Spell checking is not one technique, it is a spectrum, and where you land should be driven by business constraints, not by what is trending that week.

None of these is objectively best. Four points on a cost, latency, and accuracy curve, and the right one depends entirely on what you are building.
The point here is not which language or database we used, it is that the shape is portable across any locale if you design it right.
Swap the dictionary for a different locale, retrain the context model on that language's corpus, and the orchestration does not change. That portability was deliberate, once we realized we might need more than one market. I am deliberately not detailing the exact technique in that scoring layer, because the point of this post was never a blueprint to copy, it is the decision making underneath it.

This is not the best architecture in some absolute sense, it is the best fit for high volume, a tight latency budget, a bias against confidently wrong corrections, and a cost ceiling that ruled out per request inference. Change any of those constraints and the right answer on the spectrum moves. That is the actual thesis: the constraints choose the tool.
When a vendor deprecates a utility and points you at their LLM based replacement, do not take it at face value just because it is the easy path. Ask what actually determines cost and correctness at your scale, not the demo's scale.
Everyone moving to LLM based solutions is a trend, not an architecture decision. The teams that get burned swap a component built for one purpose for a general purpose reasoning engine because it was first in the migration guide. The teams that get it right treat cost, latency, and error tolerance as top priority constraints from day one, and pick the point on the spectrum that actually matches the job.
Bing Spell Check being deprecated was not the hard problem. Figuring out what we actually needed a spell checker to do, at what cost and accuracy bar, was.
Subscribe to get posts like this straight to your inbox - no noise, just quality content.