Home / Voice Agents / Google Cloud Speech-to-Text v2
Voice Agents

Google Cloud Speech-to-Text v2

Google Cloud streaming and batch speech recognition API v2 with improved accuracy, streaming, and noise suppression for real-time agent pipelines.

CategoryVoice Agents
Websitecloud.google.com
TagsCloud, Pipeline
ListingStandard
Visit Google Cloud Speech-to-Text v2 ↗ More Voice Agents

What Google Cloud Speech-to-Text v2 is for

Google Cloud Speech-to-Text v2 sits in the voice agents category of the agent stack. Voice agents hold real-time spoken conversations — answering calls, qualifying leads, and handling support with sub-second latency. The category covers full phone-agent platforms and the speech infrastructure underneath them.

Typical use cases

Is this your agent?

Claim this listing to update the description and upgrade to Featured or Pro placement. Email casbattle19@gmail.com or see upgrade options.

FAQ

What is Google Cloud Speech-to-Text v2?

Google Cloud Speech-to-Text v2 is a tool in the voice agents category. Google Cloud streaming and batch speech recognition API v2 with improved accuracy, streaming, and noise suppression for real-time agent pipelines.

What is Google Cloud Speech-to-Text v2 used for?

Tools in this category are commonly used for: 24/7 ai phone answering and appointment booking; outbound lead qualification calls at scale; voice interfaces for products and devices.

What are alternatives to Google Cloud Speech-to-Text v2?

Popular alternatives in the voice agents category include Agentset, LiveKit Agents, Nuance AI, Pipecat, PolyAI. Compare them all on the Voice Agents category page.

Alternatives & related in Voice Agents

Voice Agents

Agentset

Production RAG platform with reasoning, hybrid search, and full multimodal support.

PythonRAGView →
Voice Agents

LiveKit Agents

Framework for building real-time, multimodal AI agents with voice, video, and data channels.

PythonIDEView →
Voice Agents

Nuance AI

Enterprise speech and conversational AI platform for clinical and contact-center workflows with HIPAA-capable deployments.

CloudCLIView →
Voice Agents

Pipecat

Production-grade voice AI framework with sub-250ms latency, WebRTC support, multimodal (voice+vision+text), real-time streaming, and 70+ language support.

PythonStreamingView →
Voice Agents

PolyAI

Enterprise voice AI platform for natural multi-turn conversations with high-volume call handling.

CloudVoiceView →
Voice Agents

Rasa

Open-source conversational AI framework with self-hosted NLU training and dialogue management.

PythonSelf-HostedView →