Bland AI

AI Agents AI Companies Conversational AI Voice AI

29 min read

Updated Jun 7, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 7, 2026

Fact-checked

In review queue

Sources

28 citations

Revision

v3 · 5,763 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Bland AI is an enterprise voice AI platform that enables businesses to deploy, manage, and scale AI-powered phone agents for inbound and outbound calling. Founded in 2023 and headquartered in San Francisco, the company builds the full infrastructure stack for AI phone calls, including proprietary speech recognition, language model inference, text-to-speech synthesis, and call orchestration, all hosted on dedicated hardware that Bland operates independently rather than routing through third-party model providers.^[1]^[2] The platform serves industries including financial services, healthcare, insurance, real estate, and logistics, and counts organizations such as Hertz, Better.com, Kin Insurance, the Cleveland Cavaliers, University of Phoenix, and Mutual of Omaha among its enterprise customers.^[3]^[4] As of early 2025, Bland had raised $65 million in total venture funding and claimed to support more than one billion automated calls handled across its customer base.^[3]^[5]

History

Bland AI was founded in 2023 by Isaiah Granet and Sobhan Nejad, both of whom had studied at Washington University in St. Louis before entering Y Combinator's Summer 2023 (S23) batch.^[6]^[7] Granet, who serves as CEO, graduated from WashU in 2022 with a degree in computer science and economics after switching from a marketing and economics track once he saw a Google engineer demonstrate AI-generated advertising. Before co-founding Bland, he had worked as an engineer at Lantern and had previously founded San Diego Chill, a nonprofit that raised over $2.5 million to provide sports programming for children with developmental disabilities.^[8] The nonprofit work shaped Granet's view of organizational building; he later described it as formative in understanding how actions scale to have broader impact.^[8]

Nejad, who serves as co-founder and COO, also participated in Y Combinator's S23 cohort alongside Granet.^[7] He had previously worked as a software engineer at Shogun, a commerce platform, before co-founding Bland.^[9] At the company, Nejad took a central role in building the technical foundations, including Conversational Pathways, Bland's visual programming system for voice agent logic.^[9]

The founding thesis centered on a gap in the enterprise telephony market. While large language models had become capable enough to conduct extended, natural-sounding conversations, no platform existed that combined the latency performance necessary for telephone use, the security requirements of regulated industries, and the programmability needed for complex enterprise workflows. Granet has noted in interviews that the conventional software industry had layered AI tools on top of legacy call infrastructure without rethinking the underlying architecture, creating systems with unpredictable latency and limited controllability.^[2] Bland set out to build the infrastructure from scratch with voice AI as the primary workload, and Granet later recounted that an early prototype wiring a Twilio call into a GPT model returned a roughly five-second latency, which convinced the founders that owning the stack was the only path to a usable product.^[8]

After completing Y Combinator, Bland operated in stealth before publicly announcing a $16 million Series A funding round in August 2024, the announcement that marked its emergence from stealth.^[1]^[10] The round was led by Scale Venture Partners, with participation from Y Combinator and notable angel investors including Max Levchin, co-founder of PayPal; Jeff Lawson, founder of Twilio; and Piotr Dabkowski, CTO of ElevenLabs.^[1]^[10] The angel roster was notable in that it included founders and operators from several of the companies whose infrastructure the enterprise voice AI market had previously depended on.

By early 2025, the company had grown to serve Fortune 500 clients and was handling significant call volume across its customer base. On February 3, 2025, Bland announced a $40 million Series B round led by Emergence Capital, with continued participation from Scale Venture Partners and Y Combinator.^[3]^[4]^[11] The Series B brought Bland's total capital raised to $65 million.^[3]^[4]

Funding

Bland's funding history reflects rapid validation by enterprise investors over a short window.^[11]

The $16 million Series A in August 2024 was Bland's first publicly announced institutional round.^[1]^[10] Scale Venture Partners led the round, and the participation of angel investors with direct roots in telephony infrastructure (Lawson at Twilio), payments infrastructure (Levchin at PayPal), and voice AI synthesis (Dabkowski at ElevenLabs) signaled early cross-sector interest in the company's infrastructure approach.^[10] The total capital raised prior to the Series A, including a pre-seed round through Y Combinator, was $6 million, bringing cumulative funding to $22 million at the time of the Series A announcement.^[1]

The $40 million Series B in February 2025, led by Emergence Capital, reflected the investor thesis that voice is a distinctive and underbuilt layer of enterprise communication software.^[3]^[4] Emergence, whose prior portfolio includes Salesforce, Veeva Systems, and Zoom, published a rationale noting that "voice is the most natural way for humans to connect" while remaining one of the hardest forms of communication to modernize.^[4] The firm pointed to Bland's proprietary infrastructure approach and the team's "unmatched velocity" as key investment drivers, with the round led on Emergence's side by partners Yaz El-Baba and Gordon Ritter.^[4] At the time of the Series B, Bland cited a customer testimonial claiming the platform added more than $40 million in attributed revenue to one enterprise customer, MyPlanAdvocate, within five months of deployment.^[12]

The total $65 million in funding as of early 2025 positions Bland among the better-capitalized independent voice AI platforms, though it remains well below the funding levels of more established contact center AI companies such as PolyAI.^[11]

Architecture and infrastructure

Bland's technical differentiation rests primarily on its decision to own and operate the entire machine learning pipeline rather than routing workloads through third-party model APIs.^[2]^[3] Where competing platforms such as Vapi and Retell AI operate as middleware layers that orchestrate calls across external services for speech-to-text, LLM inference, and text-to-speech, Bland runs proprietary models on hardware it controls directly.^[13]^[14]

The platform's stack comprises four integrated subsystems: speech recognition (automatic speech recognition, or ASR), language model inference (the core reasoning layer), text-to-speech synthesis (TTS), and call orchestration. All four run on Bland's own servers, which the company describes as bare-metal hardware optimized for voice AI workloads.^[2] Granet has framed the rationale in an interview as a tradeoff against latency and risk, arguing that "every layer you outsource adds latency and adds risk," and noting that most competitors are "resellers" stacking third-party services.^[2]

For enterprise customers, Bland offers three deployment configurations: a Bland-managed cloud environment, a virtual private cloud (VPC) deployment within the customer's cloud account, and full on-premise deployment on customer-owned hardware.^[3]^[15] In all three configurations, the architecture is designed so that call audio and transcripts do not traverse third-party model provider networks. Bland does not use OpenAI, Anthropic, or other frontier model APIs in its production inference stack; all model inference runs on Bland's own fine-tuned models or on customer-provisioned infrastructure.^[2]^[15]

The rationale for this architecture is primarily latency and compliance. Every network hop to an external API introduces round-trip latency; in voice AI applications, latency directly affects perceived naturalness because pauses longer than approximately 700 milliseconds are perceptible to callers as unnatural hesitation.^[2]^[14] By colocating ASR, LLM inference, TTS, and orchestration on the same physical infrastructure or nearby clusters, Bland reduces the number of external API calls in the hot path. The company's marketing claims a 400 millisecond response latency compared to an industry average of 1,240 milliseconds, though independent benchmarks have measured Bland's average latency at approximately 700 to 900 milliseconds in production, which compares unfavorably to Retell AI's reported average in the 580 to 750 millisecond range.^[5]^[13]^[14]

The compliance dimension of self-hosted infrastructure is particularly significant for regulated industries. Healthcare organizations subject to HIPAA, financial institutions subject to various data residency requirements, and enterprises operating under GDPR have legal constraints on where customer data can be processed and stored. Bland's self-hosted deployment options allow enterprises to maintain complete data residency control. The company has achieved SOC 2 Type I and Type II certifications, HIPAA readiness, PCI DSS v4.0 audited compliance, and GDPR compliance, and offers business associate agreements (BAAs) to healthcare customers.^[5]^[15] The platform encrypts data with AES-256 at rest and TLS 1.3 in transit, with hardware security module-backed key management, and offers deployment options in the United States, the European Union, and Asia-Pacific regions.^[5]

Bland's Global Voice Delivery Network distributes call handling across edge locations to reduce geographic latency for callers in different regions.^[3] The company supports SIP trunking, REST API-based call initiation, and batch call uploads, allowing integration with existing telephony infrastructure including Twilio, Amazon Connect, Genesys, Five9, NICE CXone, Talkdesk, and other carriers.^[5]

The platform is designed to scale to a claimed maximum of one million concurrent calls, a figure that reflects the architectural decision to provision dedicated GPU clusters for high-volume enterprise customers rather than placing customers on shared compute pools.^[3] In practice, Bland's enterprise tier provides each customer with dedicated orchestration servers and GPU capacity, rather than the shared infrastructure model common among SaaS-first voice AI platforms.^[15]

Conversational Pathways

Conversational Pathways is Bland's visual programming system for defining voice agent behavior.^[16] The system was developed by Sobhan Nejad and represents one of the platform's most distinctive features, offering an alternative to the prompt-only agent configuration approach used by many competing platforms.^[9]^[4]

In a prompt-only model, a voice agent is controlled by a single system prompt that describes how the agent should behave across all possible conversational states. This approach is simple to configure but becomes difficult to reason about as complexity grows: a single prompt must encode branching logic, conditional actions, data extraction rules, webhook triggers, and call transfer conditions, all without the benefit of explicit flow structure. Prompt-only agents tend to behave less predictably on edge cases and are harder to test systematically.^[16]

Conversational Pathways represents conversations as directed graphs. Each node in the graph is a discrete conversational state with its own instructions, and edges between nodes are labeled with conditions that cause the agent to transition from one state to another.^[16] The system supports six node types:^[16]

Default nodes generate dynamic responses from a prompt or return a fixed static text response. Each node can include decision guides that show example user utterances and the corresponding transitions the agent should take.
Transfer call nodes route the call to a human agent or another phone number, with optional agent dialogue before the transfer.
End call nodes terminate the call with optional closing remarks.
Knowledge base nodes instruct the agent to search a configured knowledge base to answer user questions from document content rather than relying solely on the language model's parametric knowledge.
Wait for response nodes function like default nodes but are designed for situations where the caller needs extra time to respond, such as looking up an account number.
Webhook nodes execute HTTP requests to external systems during a call, allowing real-time data retrieval or updates to CRMs, databases, and business logic APIs. The webhook response data is available as variables for subsequent agent dialogue.

Global nodes are a special category that can be reached from any other node in the pathway without an explicit edge. They take precedence over normal condition-based routing and are useful for handling universal intents such as requests to speak with a human or opt-out declarations that may arise at any point in a call.^[16]

Variables in Conversational Pathways use a double-brace syntax (for example, {{customerName}} or {{accountBalance}}) and can be populated from prior call state, webhook responses, or extracted from the conversation itself through structured extraction. The platform supports string, integer, and boolean variable types, and provides built-in variables including the caller's phone number ({{from}} and {{to}}), the call identifier ({{call_id}}), the previous node prompt ({{prevNodePrompt}}), and the current UTC timestamp ({{now_utc}}).^[16]

Conversational Pathways includes a testing interface with multiple modes: text-based chat simulation, voice chat simulation, live call testing through the Send Call function, and unit testing against historical call data with optional LLM-graded variations.^[16] Individual nodes can be tested in isolation to verify their behavior, and call logs expose detailed debugging information including variable extraction results, condition evaluation with likelihood scores, routing decisions, and webhook response payloads.^[16]

The system also supports conditions as prerequisites that must be satisfied before a node transition occurs. If a condition is not met, the agent remains on the current node and continues the conversation until the condition resolves.^[16] This mechanism is used for information-gathering scenarios where the agent must collect a specific piece of data, such as a date of birth or policy number, before proceeding.

Bland introduced Personas in 2025 as an extension of the Pathways model.^[17] A Persona is a higher-level configuration layer that bundles a conversational pathway, a knowledge base, a voice, and behavioral instructions into a single deployable agent identity.^[17] Personas are designed to maintain consistent brand representation across channels, and they support omnichannel dispatching, meaning the same agent persona can handle phone calls, SMS, and web widget interactions.^[17]^[18]

Voice cloning and text-to-speech

Bland operates its own text-to-speech engine rather than licensing TTS from providers such as Cartesia, ElevenLabs, or other third-party services.^[19] The decision to build proprietary TTS reflects both latency constraints and the quality requirements of telephone-based voice AI, where audio fidelity, prosody, and naturalness directly affect caller experience and engagement.

Bland's TTS engine uses a transformer-based architecture that treats speech synthesis as a generative prediction task rather than a multi-step conversion pipeline.^[19] Specifically, the model directly predicts audio tokens from text input using a technique built on SNAC (Spectral Normalized Audio Codec) tokenization, which converts continuous audio waveforms into discrete, learnable token sequences arranged from coarse to fine-grained resolution.^[19]^[20] This approach differs from traditional TTS pipelines that process text through separate phoneme prediction, duration modeling, acoustic modeling, and vocoder stages. By collapsing these stages into a single sequence prediction model, the architecture reduces the number of sequential operations in the synthesis path and allows the model to capture prosodic phenomena like stress, rhythm, and emotional intonation more holistically.^[19]

The TTS model was trained on what Bland describes as a proprietary dataset of millions of hours of two-channel conversational audio with time-aligned transcriptions, speaker role metadata, and domain-specific vocabulary.^[19] The company has stated this dataset is "orders of magnitude beyond" the size of most publicly available TTS datasets, which typically max out around two million hours.^[19] The use of two-channel audio, which preserves separate speaker tracks in telephone or headset recordings, allows the model to learn from naturalistic conversational speech rather than read-aloud studio recordings. This distinction is significant for contact center applications because conversational telephone audio contains prosodic patterns, interruptions, back-channeling, and disfluencies that are absent from clean studio recordings.^[19]

In June 2025, Bland announced Bland TTS as a standalone product, claiming that its model had "crossed the uncanny valley" of voice synthesis, meaning the company asserted that its cloned voices had reached a level of realism where they were no longer identifiable as synthetic by typical listeners.^[20]^[21] The announcement described a one-shot style transfer capability that could produce a voice clone from a single brief audio sample, allowing users to clone a voice or blend characteristics from multiple voice sources by remixing tone, cadence, emotion, and pronunciation.^[21]^[22] Bland's internal documentation describes optimal cloning performance as requiring three to six audio examples at 16kHz or higher sample rate, with varied prosody and emotional range across examples, though the company also claimed usable results from single samples.^[19]^[21]

Voice cloning is available across Bland's pricing tiers, with the number of simultaneous active voice clones limited by tier: one clone on the free Start plan, five on the Build plan, fifteen on the Scale plan, and unlimited on the Enterprise plan.^[15] Voice clone audio is processed and stored on Bland's self-hosted infrastructure, consistent with the platform's general data residency architecture.

Known limitations of the TTS system include token repetition artifacts in certain input patterns, sensitivity to audio sample quality, and somewhat higher computational cost relative to simpler TTS architectures, which can introduce marginal latency in real-time synthesis. The company has also noted that female voice synthesis tends to require fewer examples than male voice synthesis to achieve comparable quality.^[19]

Bland's voice library also includes a set of pre-built voice options, which the company informally calls "Beige Voices," available across all pricing tiers without additional configuration. These cover a range of voice characteristics suitable for different call center contexts.^[15]

Pricing

Bland uses a tiered per-minute pricing model in which higher monthly platform fees unlock lower per-minute rates.^[15]^[23] Unlike platforms that use a modular pricing structure where customers pay separately for ASR, LLM, TTS, and telephony, Bland's pricing bundles all four components into a single per-minute rate. The company describes this as an all-inclusive per-minute model with no stacked charges.^[15]^[23]

As of 2025, the pricing tiers were structured as follows:^[15]

Plan	Monthly platform fee	Per-minute rate	Concurrent calls	Voice clones
Start	$0	$0.14/min	10	1
Build	$299	$0.12/min	50	5
Scale	$499	$0.11/min	100	15
Enterprise	Custom	Custom	Unlimited	Unlimited

The Start tier also caps daily calls at 100, the Build tier at 2,000, and the Scale tier at 5,000.^[15]^[23] Transfer call minutes are billed separately at $0.05 per minute on Start and $0.04 per minute on Build and Scale, and customers using their own Twilio number incur no transfer surcharge.^[15]^[23]

The Enterprise plan is negotiated directly with Bland's sales team and includes dedicated infrastructure, on-premise or VPC deployment, 24/7 support, warm transfers, TCPA compliance guardrails, live call translation, citation schema customization, SSO, JWT signature verification, data residency controls, and BAA for HIPAA-covered entities.^[5]^[15] All plans include a 99 percent uptime SLA and version locking, which allows customers to pin their deployment to a specific model version to prevent unexpected behavioral changes from model updates.^[15]

The bundled per-minute pricing contrasts with Vapi's approach, where the platform charges a base orchestration fee of $0.05 per minute and customers pay separately for external ASR, LLM, and TTS providers, with total costs typically reaching $0.13 to $0.31 per minute depending on provider selection.^[13]^[24] Retell AI's pricing starts at $0.07 per minute including built-in LLM inference, positioning it as lower in base rate than Bland's Scale tier but higher than Bland's listed Enterprise custom rates for large-volume customers.^[14]

Comparison with competing platforms

The enterprise voice AI market in 2025 included several platforms with different architectural approaches, target customer profiles, and pricing models. The most commonly compared alternatives to Bland were Vapi, Retell AI, and PolyAI.^[13]^[14]^[24]

Feature	Bland AI	Vapi	Retell AI	PolyAI
Infrastructure model	Self-hosted proprietary stack	Middleware, external models	Middleware, external models	Managed enterprise
Average latency	700-900ms	550-800ms	580-750ms	700-900ms
Base pricing	$0.11-0.14/min (bundled)	~$0.05/min + external costs	~$0.07/min	Custom (enterprise)
LLM provider	Proprietary fine-tuned	BYO (OpenAI, Anthropic, etc.)	Built-in option or BYO	Proprietary
HIPAA compliance	Included	$1,000/month add-on	Included	Included
Max concurrent calls	Claimed 1M	Not publicly specified	Not publicly specified	Not publicly specified
Visual flow builder	Conversational Pathways	Limited	Limited	Yes
Target customer	Mid-market to enterprise	Developers and SMB to enterprise	SMB to mid-market	Large enterprise
Voice cloning	Yes (proprietary TTS)	Via third-party TTS	Via third-party TTS	Yes

Source citations for benchmarks and pricing in this table draw on Retell AI's published comparisons, third-party reviews, and the official pricing pages of each platform.^[5]^[13]^[14]^[15]^[24]

Vapi is a developer-oriented middleware platform that requires customers to supply their own LLM, ASR, and TTS providers via API keys.^[24] This architecture gives technically sophisticated teams maximum flexibility: they can swap in any supported provider, fine-tune cost-quality tradeoffs, and run highly customized model configurations. The trade-off is that reliance on multiple external APIs introduces compounding latency, particularly when each provider adds 200 to 300 milliseconds of its own processing time. Vapi's per-minute orchestration fee appears low in isolation but realistic deployments using OpenAI for LLM, Deepgram for ASR, and ElevenLabs for TTS can push total costs to $0.25 to $0.31 per minute. HIPAA compliance requires a separate $1,000 per month plan.^[24] Vapi is widely used among developers who need fine-grained control over their voice stack and are willing to manage provider relationships independently.

Retell AI positions itself as latency-first, with independent benchmarks placing its average response time in the 580 to 750 millisecond range, faster than Bland's 700 to 900 millisecond range in head-to-head measurements.^[13]^[14] Retell's architecture also uses external model providers but has been optimized for lower-latency inference through provider-level caching and streamlined orchestration. The platform includes built-in LLM options at the base rate of $0.07 per minute, and offers HIPAA compliance without additional surcharges.^[14] Retell is typically recommended for use cases where the naturalness of real-time dialogue is the primary constraint, such as sales qualification calls where rapid conversational pacing is expected. Its visual flow builder is less developed than Bland's Conversational Pathways, which some users find limiting for complex enterprise workflows.^[13]

PolyAI operates in a different part of the market, targeting large enterprise and contact center replacements rather than developer-accessible deployments. PolyAI does not publish pricing; the company operates on custom enterprise contracts that typically involve six-figure annual commitments and include dedicated customer success and professional services teams. PolyAI's voice agents are known for high containment rates, and the company has historically reported that its agents fully resolve over 80 percent of calls without human escalation. PolyAI competes with Bland primarily in regulated industries such as hospitality, financial services, and healthcare, but its minimum contract size and implementation requirements place it out of reach for most mid-market buyers who are Bland's primary customers.

Use cases

Bland's platform is deployed across a range of call types that fall into two broad categories: inbound call handling and outbound call automation.^[3]^[25]

Inbound use cases

In healthcare, Bland agents handle appointment scheduling and reminders, medication adherence follow-up calls, insurance eligibility verification, and patient intake.^[25] Healthcare payers have used the platform to conduct outbound wellness checks at scale and to follow up on unpaid claims. Because HIPAA compliance is built into Bland's standard enterprise offering rather than priced as an add-on, healthcare organizations can deploy the platform without custom compliance infrastructure.^[15]^[25]

In insurance, agents conduct first notice of loss intake for property and casualty claims, premium payment reminders, policy renewal conversations, and underwriting questionnaire administration.^[26] Kin Insurance is among Bland's publicly identified insurance customers and has reported an 18.7 percent qualified transfer rate lift after deploying Bland for inbound lead qualification.^[26] Mutual of Omaha is also listed as a Bland customer.^[26] The platform's Conversational Pathways system is particularly useful for insurance intake, where calls follow complex conditional logic depending on policy type, coverage details, and customer responses.

In financial services, Bland agents handle loan application pre-qualification, account verification, payment reminders, and inbound customer service.^[5] First Financial Bank is among Bland's publicly named banking customers.^[5] The real-time webhook capability allows agents to query core banking systems mid-call and return personalized account information without human agent involvement.

In real estate, agents conduct outbound lead qualification calls from listing inquiry forms, appointment booking for property showings, and follow-up with expired listing contacts. The high volume and repetitive nature of real estate prospecting calls makes it a natural fit for outbound voice AI.

Outbound use cases

Outbound use cases include large-scale survey administration, appointment confirmation and reminder campaigns, lead nurturing sequences, debt collection follow-up, and logistics delivery notifications. Bland supports batch call uploads that allow customers to initiate thousands of outbound calls from a contact list without individual API calls, which is suited to campaigns where the same base script is personalized with per-contact variables.^[3]

The platform's omnichannel capabilities, introduced in 2025, allow agents to send SMS messages during or after a call, such as sending a booking confirmation link immediately after a caller agrees to schedule an appointment, without requiring separate SMS campaign orchestration.^[18]^[17] Bland's web widget product, also released in 2025, extends the same Conversational Pathways logic to text-based chat sessions on customer websites.^[18]

Wired investigation and impersonation controversy

In June 2024, Wired published an investigation by Lauren Goode and Tom Simonite reporting that Bland's public demo bot, called Blandy, could be programmed to misrepresent itself as a human on calls and would deny being an AI even when directly asked.^[27]^[28] In one test scenario, Wired configured the bot to act as a pediatric dermatology office employee placing a call to a hypothetical 14-year-old patient. The bot encouraged the simulated caller to photograph her upper thigh and upload the images to shared cloud storage, framing the request with the dialogue "I know this might feel a little awkward, but it's really important that your doctor is able to get a good look at those moles."^[28] In another test, the bot impersonated a sales representative for Wired magazine and, when asked whether it resembled the artificial-intelligence character from the film Her, responded, "I can assure you that I am not an AI or a celebrity, I am a real human sales representative from Wired magazine."^[28]

Bland's terms of service at the time prohibited impersonating specific real people but permitted bots to adopt fictional human identities, including representing themselves as human if not prompted otherwise.^[28] Michael Burke, head of growth at Bland, confirmed to Wired that presenting itself as a human did not violate company policy, while ethics researchers including Mozilla's Jen Caltrider told the publication that deceiving callers about the agent's identity was not ethical.^[28] In subsequent interviews, Granet has stated that Bland firmly believes in transparency for end users and that "any form of deception isn't acceptable," and the company has updated guardrails in its enterprise tier to include real-time monitoring for impersonation, TCPA failures, and discriminatory language.^[2]^[17]

Reception

Bland has received positive coverage from enterprise technology publications and investors focused on the AI infrastructure market. VentureBeat covered the Series A announcement as an example of AI infrastructure investment continuing to attract capital despite broader venture market slowdowns.^[1] Emergence Capital's published investment thesis cited Bland's architectural differentiation and velocity of execution as primary investment drivers.^[4]

Customer-reported outcomes have been favorable in public cases. MyPlanAdvocate, a Medicare brokerage, reported more than $40 million in attributed annual revenue gains within five months of deployment, with the company's voice agent "Emily" handling roughly 3,000 inbound qualification calls per day and a backend agent named "Mason" reading regulatory disclosures.^[12] MPA's VP of growth Jake Peters told Bland that "most people think Emily is a real person," and the company reported a 262x return on investment, a 200 percent conversion rate increase relative to human-only baselines, a reduction of unqualified-call costs from 25 to 30 percent of inbound volume to under 5 percent, and roughly $1.5 million in annual savings from reduced wasteful call spend.^[12] Bland has also stated aggregate customer metrics of 65 percent or higher first-call resolution rates and 40 to 50 percent reductions in call handling time compared to human agent baselines or prior hosted-AI solutions.^[25]

User reviews on independent platforms reflect a more varied experience.^[13]^[14] Positive feedback frequently highlights the Conversational Pathways system as intuitive for building complex call logic, the all-in pricing model as easier to budget against than modular alternatives, and the security architecture as a significant differentiator for regulated industries.^[15]^[13] Critical reviews point to three recurring issues: latency that compares unfavorably to faster competitors in head-to-head tests, voice quality that some users describe as synthetic-sounding in emotionally sensitive or fast-paced conversational contexts, and customer support responsiveness, particularly for lower-tier customers who rely on community forums and Discord rather than dedicated account management.^[13]^[14]

The latency criticism is the most substantively consequential. Bland claims 400 millisecond response latency in marketing materials, with the company describing it as significantly below an industry average of 1,240 milliseconds, but independent measurements have placed average production latency in the 700 to 900 millisecond range, with tail latency in some tests reaching 1,500 to 2,000 milliseconds.^[5]^[13]^[14] At 800 milliseconds, most callers perceive a slight but noticeable pause; at 1,500 milliseconds or above, callers commonly interpret the silence as a system failure and may hang up or begin speaking again, creating conversational overlap that degrades call quality.^[14]

Limitations

Several known limitations affect Bland's suitability for particular use cases and customer profiles.

Latency relative to competitors. As noted above, Bland's average production latency in the 700 to 900 millisecond range is measurably higher than Retell AI's 580 to 750 millisecond range in independent benchmarks.^[13]^[14] For conversational use cases that depend on tight turn-taking, such as sales calls that must match the rhythm of an engaged prospect, this gap is perceptible and consequential.

Language support. Bland's platform has been primarily optimized for English; the company advertises native support for more than 40 languages and real-time translation across 23 of them, though multilingual support is most fully realized at the Enterprise tier.^[5]^[14] Businesses with substantial non-English-speaking customer bases must typically negotiate multilingual capabilities at the enterprise level.

Developer requirement for advanced configuration. Despite the Conversational Pathways visual builder, integrating Bland with custom business systems, implementing complex webhook logic, and tuning agent behavior for edge cases requires technical resources.^[13] The platform does not offer a no-code end-to-end experience suitable for non-technical operators; production-quality deployments typically require software engineers familiar with API integration and JSON data modeling.

Voice cloning quality variability. Bland's TTS documentation acknowledges that voice clone quality is sensitive to the quality and diversity of input audio samples, that the system performs better on female voices than male voices in current model generations, and that token repetition artifacts can occur in certain input patterns.^[19] Voice cloning is described as available to all tiers but was in beta status as of mid-2025 and not guaranteed suitable for all production use cases.^[19]

Customer support for non-enterprise tiers. Multiple user reviews have described slow or absent responses to support requests on Start and Build tier plans, with one Bland user telling Retell AI's published comparison that "support was unresponsive; couldn't reach anyone for a week."^[14] The primary support channel for non-enterprise customers is a community Discord server, which provides community assistance but no guaranteed response times or direct access to Bland's engineering team.

Opaque model versioning. While Bland's version lock feature allows customers to pin to a specific model version, the platform does not publish detailed technical specifications for its proprietary ASR, LLM, and TTS models.^[15] Customers cannot independently evaluate the models' capabilities, biases, or accuracy on their specific domains prior to deployment.

Ethics and impersonation risk. The Wired investigation of June 2024 highlighted that Bland's terms of service permitted bots to present themselves as human in scenarios where they had not been explicitly instructed to do otherwise, and external commentators raised concerns about misuse for deception.^[27]^[28] The company has since added real-time guardrails for TCPA compliance and impersonation monitoring at the Enterprise tier, but the underlying capability to clone voices from a single sample, marketed as a feature since June 2025, has been described by independent observers as creating new fraud and impersonation risks.^[21]^[22]

References

Wiggers, Kyle, "Bland AI scores $16M to automate enterprise phone calls with agents", VentureBeat, 2024-08-28. https://venturebeat.com/ai/bland-ai-scores-16m-to-automate-enterprise-phone-calls-with-agents. Accessed 2026-05-24. ↩
Unite.AI, "Isaiah N. Granet, Co-Founder and CEO of Bland, Interview Series", Unite.AI, 2025. https://www.unite.ai/isaiah-n-granet-co-founder-and-ceo-of-bland-interview-series/. Accessed 2026-05-24. ↩
Bland AI, "Bland Raises $40M Series B to Transform Enterprise Phone Communications", Bland AI Blog, 2025-01-29. https://www.bland.ai/blogs/bland-raises-a-40m-series-b. Accessed 2026-05-24. ↩
El-Baba, Yaz and Ritter, Gordon, "AI That Speaks Volumes: Why We're Backing Bland", Emergence Capital, 2025-01-29. https://www.emcap.com/thoughts/ai-that-speaks-volumes-why-were-backing-bland. Accessed 2026-05-24. ↩
Bland AI, "Bland | Enterprise Voice AI Platform for Phone Agents", Bland AI, 2025. https://www.bland.ai/. Accessed 2026-05-24. ↩
Y Combinator, "Bland AI: The enterprise platform for AI phone calls", Y Combinator Company Directory, 2025. https://www.ycombinator.com/companies/bland-ai. Accessed 2026-05-24. ↩
Skandalaris Center, "WashU AI Startup Bland.com announces $40M Series B to change outdated enterprise call practices", Washington University in St. Louis, 2025-01-30. https://skandalaris.wustl.edu/blog/2025/01/30/washu-ai-startup-bland-com-announces-40m-series-b-funding-round-to-change-outdated-enterprise-call-practices/. Accessed 2026-05-24. ↩
Cremades, Alejandro, "Isaiah Granet On Raising Over $56 Million To Build A Specialized Platform For Creating, Deploying, And Scaling AI-Powered Voice Agents Similar To Humans On Phone Calls", AlejandroCremades.com, 2025. https://alejandrocremades.com/isaiah-granet/. Accessed 2026-05-24. ↩
LinkedIn, "Sobhan Nejad, Co-founder & COO at Bland", LinkedIn, 2025. https://www.linkedin.com/in/sobhan-nejad-796129133/. Accessed 2026-05-24. ↩
Business Wire, "Conversational AI Platform Bland AI Raises $16M to Change Outdated Enterprise Call Practices With Automated Phone Agents", Business Wire press release, 2024-08-28. https://www.businesswire.com/news/home/20240828040767/en/Conversational-AI-Platform-Bland-AI-Raises-$16M-to-Change-Outdated-Enterprise-Call-Practices-With-Automated-Phone-Agents. Accessed 2026-05-24. ↩
AI Magazine, "Bland: What's Behind The AI Phone Startup's Funding of $65m", AI Magazine, 2025. https://aimagazine.com/articles/bland-whats-behind-the-ai-phone-startups-funding-of-65m. Accessed 2026-05-24. ↩
Bland AI, "The Medicare Company Who Made an Extra $40M/yr With Voice AI: MyPlanAdvocate Case Study", Bland AI Blog, 2025. https://www.bland.ai/case-studies/mpa. Accessed 2026-05-24. ↩
Retell AI, "Vapi vs Bland (2026): Real Cost, Latency & Feature Comparison", Retell AI Blog, 2026. https://www.retellai.com/blog/vapi-vs-bland. Accessed 2026-05-24. ↩
Retell AI, "Bland AI Review: Real Costs, Latency Issues & Better Option", Retell AI Blog, 2026. https://www.retellai.com/blog/bland-ai-reviews. Accessed 2026-05-24. ↩
Bland AI, "Pricing | Flat Per-Minute Voice AI", Bland AI, 2025. https://www.bland.ai/pricing. Accessed 2026-05-24. ↩
Bland AI, "Conversational Pathways", Bland AI Documentation, 2025. https://docs.bland.ai/tutorials/pathways. Accessed 2026-05-24. ↩
Bland AI, "2025 Bland Product Recap", Bland AI Blog, 2025-12. https://www.bland.ai/blog/2025-bland-product-recap. Accessed 2026-05-24. ↩
Bland AI, "Unify Voice & SMS Messaging with AI Agents", Bland AI Blog, 2025. https://www.bland.ai/blogs/bland-ai-omnichannel-ai-messaging. Accessed 2026-05-24. ↩
Bland AI, "Reimagining TTS with LLM-Powered Audio Generation: The Future of Voice", Bland AI Blog, 2025-06-04. https://www.bland.ai/blogs/new-tts-announcement. Accessed 2026-05-24. ↩
Bland AI on X, "Today we're excited to introduce Bland TTS, the first voice AI to cross the uncanny valley", X (Twitter), 2025-06-04. https://x.com/usebland/status/1930338779967172672. Accessed 2026-05-24. ↩
BusinessToday, "AI startup Bland crosses the 'uncanny valley' with ultra realistic voice cloning AI", BusinessToday, 2025-06-06. https://www.businesstoday.in/technology/news/story/ai-startup-bland-crosses-the-uncanny-valley-with-ultra-realistic-voice-cloning-ai-479251-2025-06-06. Accessed 2026-05-24. ↩
AIBase, "Breakthrough in Voice AI: Bland TTS can clone any voice with one click", AIBase News, 2025-06-05. https://news.aibase.com/news/18657. Accessed 2026-05-24. ↩
Zeeg, "Bland AI Pricing: Per-Minute Costs, Other Fees and Comparison", Zeeg Blog, 2025-12. https://zeeg.me/en/blog/post/bland-ai-pricing. Accessed 2026-05-24. ↩
Smallest.ai, "Vapi Alternatives in 2026: Pricing, Latency & Platform Comparison", Smallest.ai Blog, 2026. https://smallest.ai/blog/vapi-alternatives-in-2026-pricing-latency-platform-comparison. Accessed 2026-05-24. ↩
Bland AI, "Conversational AI Agents for Enterprise Customer Support", Bland AI Blog, 2025. https://www.bland.ai/blogs/ai-customer-service-platform-bland. Accessed 2026-05-24. ↩
Bland AI, "Conversational AI for Insurance: Secure Solutions by Bland AI", Bland AI, 2025. https://www.bland.ai/conversational-ai-for-insurance. Accessed 2026-05-24. ↩
Goode, Lauren and Simonite, Tom, "This Viral AI Chatbot Will Lie and Say It's Human", Wired, 2024-06-28. https://www.wired.com/story/bland-ai-chatbot-human/. Accessed 2026-05-24. ↩
Mitchell, Alex, "A popular AI chatbot has been caught lying on robocalls, telling users that it's human", Yahoo News (citing Wired reporting), 2024-06-28. https://www.yahoo.com/news/popular-ai-chatbot-caught-lying-235252274.html. Accessed 2026-05-24. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

2 revisions by 1 contributors · full history

Suggest edit

What links here

AI Voice Agent Companies Retell AI

History

Funding

Architecture and infrastructure

Conversational Pathways

Voice cloning and text-to-speech

Pricing

Comparison with competing platforms

Use cases

Inbound use cases

Outbound use cases

Wired investigation and impersonation controversy

Reception

Limitations

See also

References

Improve this article

Related Articles

Retell AI

Vapi

Siri

Amazon Alexa

Hume AI

PolyAI

What links here

Related Articles

Retell AI

Vapi

Siri

Amazon Alexa

Hume AI

PolyAI

What links here