The 8% of startup names AI hasn't decided about yet
We probed 599 startup domain names with four large language models. Most got confident, unanimous, often-correct industry guesses — names doing semantic work, names where the LLMs and a literate human would agree about category. About 8% didn't. They cluster into five distinct patterns. For builders, those patterns are the highest-leverage band of names you can pick from today.
Why this matters
For founders building today, AI is not just where customers research products — it's also where they form first impressions. When someone asks ChatGPT, Claude, or Perplexity "what kind of company is acme.ai?", the answer is almost always confident, often based on the name alone, and very hard to redirect once cached.
For 92% of names that means the LLMs have already decided what your company is. For the 8% in this tail, that decision hasn't been made — which means a deliberate brand strategy can shape it.
Per namedesk's intent thesis: high LLM recall is good for monetizers (the brand has done work). Low recall is good for builders (you have room to do work). Names matching one of the five patterns below sit in the latter camp.
The five patterns
A compound word where one root is a software-engineering term ("greenfield", "fork", "branch") AND a real-estate or land-use term. The LLMs split based on which industry vocabulary the model has seen the root used in more often.
A founder picking a name with this shape gets to choose which vocabulary they reinforce in their copy and SEO. Whichever lane the company occupies, the OTHER reading evaporates within months — a cheap source of brand-narrative control.
A real English word that evokes a vibe but no specific category. The LLMs reach for the cultural gravity of the word (entertainment, media) but a substantial minority pivot to "tech" because the .com TLD reads enterprise to them.
High brand-design freedom. The name does not pre-commit you to a sector — the visual identity, copy, and product define what the word means in your context. Risk: a competitor could ship the same name in the OTHER lane and own the alternative reading globally.
Pure invented word, no recognizable root. The LLMs default to "must be a startup of some kind" and split between the three commonest categories: a shopping-feeling cluster (e-commerce), a platform-feeling cluster (tech), and a soft-consonant cluster (health/wellness). The split tracks the phonetic feel of the syllables.
Closest thing to a true blank canvas. No semantic baggage to fight. The cost: zero free brand work — every customer impression starts at "I have never heard of this and have no idea what it does." Worth it only if your product strategy includes intentional category-creation.
Suffixes like "-pad" or "-dock" are commercial-real-estate vocabulary AND developer-tool vocabulary. Combined with a generic prefix, the LLMs split based on which vocabulary they've seen the suffix used with more in their training data.
A subset of the two-meaning-compound pattern. Same advice applies — pick a lane, double down in your copy, the other reading goes quiet.
Four-letter abbreviations have no semantic content for the LLM to work with. The model defaults to the three most common abbreviation-resolution categories in its training data and splits.
Maximum brand freedom and maximum required brand work. The name carries zero semantic load — every association is one you build deliberately. Best paired with a category-defining product strategy and a meaningful budget for early brand impressions.
The hard part: finding your own
Reading patterns is easy. Finding a name in this band that is also available, pronounceable, and aligned with your product strategy is hard. Traditional naming heuristics — pronounceability, length, .com availability — don't tell you whether the LLMs have already made a decision about the name.
The four-LLM probe namedesk runs on every name takes about ten seconds and surfaces the same data we used to identify these patterns. If the inference and coherence scores both come back high, the name has semantic weight — useful when you want a name that does brand-work for you, costly when you'd rather define the brand yourself. If the scores come back split (this band), you have a blank-canvas name and the responsibility that comes with it.
For the inverse case — how AI invents meaning even for nonsense names — see our companion piece below.