Google Meet AI Voice Translation

A concept and research project designing real-time AI voice translation natively into Google Meet, so people across different languages can stay in the conversation.

MY ROLE

UX & Interaction Designer

MY TEAM

TIMELINE

3 Months

5

Research Participants

3

Core Features Designed

I/O 2025

Google validated this project

PROBLEM

Google Meet connects people across the world, but when conversations cross languages, the experience breaks down, users fall behind, lose context, and reach for Google Translate mid-call.

OUTCOME

Designed a privacy-first translation feature with live captions and post-call transcripts, built natively into Meet's interface, and prototyped it with real AI-generated voice and video to make it feel real.

Google Meet AI Voice Translation

A concept and research project designing real-time AI voice translation natively into Google Meet, so people across different languages can stay in the conversation.

MY ROLE

UX & Interaction Designer

MY TEAM

TIMELINE

3 Months

PROBLEM

Google Meet connects people across the world, but when conversations cross languages, the experience breaks down, users fall behind, lose context, and reach for Google Translate mid-call.

OUTCOME

Designed a privacy-first translation feature with live captions and post-call transcripts, built natively into Meet's interface, and prototyped it with real AI-generated voice and video to make it feel real.

5

Research Participants

3

Core Features Designed

I/O 2025

Google validated this project

Google Meet AI Voice Translation

A concept and research project designing real-time AI voice translation natively into Google Meet, so people across different languages can stay in the conversation.

MY ROLE

UX & Interaction Designer

MY TEAM

TIMELINE

3 Months

PROBLEM

Google Meet connects people across the world, but when conversations cross languages, the experience breaks down, users fall behind, lose context, and reach for Google Translate mid-call.

OUTCOME

Designed a privacy-first translation feature with live captions and post-call transcripts, built natively into Meet's interface, and prototyped it with real AI-generated voice and video to make it feel real.

5

Research Participants

3

Core Features Designed

I/O 2025

Google validated this project

Context

Context

This project was designed to live inside Google Meet, not as a standalone translation tool, but as a native feature that fits into how a call already works, built for the moment language becomes a barrier.

This project was designed to live inside Google Meet, not as a standalone translation tool, but as a native feature that fits into how a call already works, built for the moment language becomes a barrier.

Stake

Design a translation feature people would actually trust and use mid-call.

Constraint

Concept project with no access to Google's design system or internal usage data.

Success

A feature that feels native to Meet, not bolted on.

Why Now?

Why Now?

Remote work made multilingual meetings the norm, but the tools never caught up.

Remote work made multilingual meetings the norm, but the tools never caught up.

1.8B Google Meet Users

1.8B Google Meet Users

1.8B+ people use Google Meet globally across work, education, and healthcare

1.8B+ people use Google Meet globally across work, education, and healthcare

Non-native English Speakers

Non-native English Speakers

Non-native English speakers make up the majority of Google Meet's user base

Non-native English speakers make up the majority of Google Meet's user base

Google Translate Workaround

Google Translate Workaround

Existing workarounds like Google Translate require leaving the call entirely

Existing workarounds like Google Translate require leaving the call entirely

Real-time AI translation

Real-time AI translation

Real-time AI translation became technically feasible, but no platform had integrated it

Real-time AI translation became technically feasible, but no platform had integrated it

Stake

Design a translation feature people would actually trust and use mid-call.

Constraint

Concept project with no access to Google's design system or internal usage data.

Success

A feature that feels native to Meet, not bolted on.

User Research

User Research

Structured observation sessions and concept testing to understand where language barriers break down in live calls, and what users actually needed from a translation feature.

Structured observation sessions and concept testing to understand where language barriers break down in live calls, and what users actually needed from a translation feature.

01 Community Signals

01 Community Signals

5 participants across 5 sessions

5 participants across 5 sessions

One participant stayed constant; the other changed each time, representing a different language or accent. Actions, reactions, pain points, and frustration recorded in real time.

One participant stayed constant; the other changed each time, representing a different language or accent. Actions, reactions, pain points, and frustration recorded in real time.

02 Concept Testing

02 Concept Testing

All 5 participants responded after seeing the feature Introduced the translation concept post-session and collected direct feedback on expectations, concerns, and placement preferences.

All 5 participants responded after seeing the feature Introduced the translation concept post-session and collected direct feedback on expectations, concerns, and placement preferences.

03 HSE Principles Framework

03 HSE Principles Framework

5 principles applied: Perception, Attention, Cognitive Load, Expectations, User Control
Mapped observations to design decisions, turning behavioral patterns into structured design rationale.

5 principles applied: Perception, Attention, Cognitive Load, Expectations, User Control
Mapped observations to design decisions, turning behavioral patterns into structured design rationale.

04 Competitive Landscape Tools analysed

04 Competitive Landscape Tools analysed

Google Translate, Google Meet captions
Every workaround required leaving the call. No platform had natively integrated real-time voice translation, confirming a clear product gap.

Google Translate, Google Meet captions
Every workaround required leaving the call. No platform had natively integrated real-time voice translation, confirming a clear product gap.

Google Meet Captions

Google Meet Captions

Google Translation

Google Translation

Final Research Impact: Every feature, consent first, live captions, post-call transcript, mapped directly to what participants said they needed.

Final Research Impact: Every feature, consent first, live captions, post-call transcript, mapped directly to what participants said they needed.

Problem & Opportunity

Problem & Opportunity

Google Meet connects the world, but when conversations cross languages, the experience breaks, users lose context, fall behind, and reach for workarounds that pull them out of the call.

Google Meet connects the world, but when conversations cross languages, the experience breaks, users lose context, fall behind, and reach for workarounds that pull them out of the call.

Before

Before

Google Meet = Captions only, no translation

Google Meet = Captions only, no translation

Missing

Missing

Native, real-time voice translation with user control

Native, real-time voice translation with user control

Opportunity

Opportunity

A privacy-first translation feature built into the call itself

A privacy-first translation feature built into the call itself

Target User

Target User

People in multilingual meetings who need to communicate across language barriers without disrupting the flow of the call.

People in multilingual meetings who need to communicate across language barriers without disrupting the flow of the call.

International Student

International Student

Remote-team Collaborator

Remote-team Collaborator

Healthcare Professionals

Healthcare Professionals

Cross-border Business User

Cross-border Business User

User Core Needs

User Core Needs

Minimal Delay

Minimal
Delay

Accurate Translation

Accurate Translation

Privacy Control

Privacy
Control

One-click Access

One-click Access

Post-call clarity

Post-call Clarity

Product Goals & Strategy

Product Goals & Strategy

Structured observation sessions and concept testing to understand where language barriers break down in live calls, and what users actually needed from a translation feature.

Structured observation sessions and concept testing to understand where language barriers break down in live calls, and what users actually needed from a translation feature.

The tradeoff we optimised for:

The tradeoff we optimised for:

Trade Off

Opt in > Always On

Defaulting to off builds trust faster than defaulting to on.

Fewer controls, placed better, beat more controls buried deeper.

Native UI > Feature Rich

Explanation

Trade Off

Opt in > Always On

Defaulting to off builds trust faster than defaulting to on.

Fewer controls, placed better, beat more controls buried deeper.

Native UI > Feature Rich

Explanation

Ideation

Ideation

One clear direction from the start: build translation natively into Meet, not around it.

One clear direction from the start: build translation natively into Meet, not around it.

Product Direction

Product Direction

Real-time AI voice translation

Real-time AI voice translation

Live
Captions

Live
Captions

Consent Upfront

Consent Upfront

Post-call trranscript

Post-call trranscript

+

+

+

Final Plan

Final Plan

Research
Insights

Research
Insights

Consent
Design

Consent
Design

Live
Translation

Live
Translation

Post-call
transcript

Prototype

Post-call
transcript

Prototype

Design System & Visual Language

Design System & Visual Language

Since Google Meet lives within the Google Ecosystem, I worked within Material 3 Guidelines, using Google Sans typography, and Google's color system to ensure every feature felt like it already belonged.

Since Google Meet lives within the Google Ecosystem, I worked within Material 3 Guidelines, using Google Sans typography, and Google's color system to ensure every feature felt like it already belonged.

Addition to Design System

Addition to Design System

Speaker and Listen Dropdowns

Current Google Meet Icons

AI Speaking Indicator

Request Participants Button

Live Transcript Labels

Voice Translation Toggle

Speaker and Listen Dropdowns

Current Google Meet Icons

AI Speaking Indicator

Request Participants Button

Live Transcript Labels

Voice Translation Toggle

Key Screens

Key Screens

Primary Consent

Permission Request

Shows exactly who requested translation and why, before anything is recorded

Language Pair

Speaker and listener languages set upfront, no surprises mid-call

Approve Request CTA

Nothing starts until the user actively confirms

Permission Request

Shows exactly who requested translation and why, before anything is recorded

Language Pair

Speaker and listener languages set upfront, no surprises mid-call

Approve Request CTA

Nothing starts until the user actively confirms

Live Translation

Hear Translation Toggle

One-click control to turn audio translation on or off without leaving the come all

Live Dual-Language Transcript

Both the original and translated text shown simultaneously, inline

Active Speaker Indicator

Blue ring signals who is currently being translated

Hear Translation Toggle

One-click control to turn audio translation on or off without leaving the come all

Live Dual-Language Transcript

Both the original and translated text shown simultaneously, inline

Active Speaker Indicator

Blue ring signals who is currently being translated

Post-Call Transcript

Translation Summary Card

Surfaces automatically after the call, no searching for it

Privacy Reminder

Confirms transcript complies with privacy policies, reinforcing trust post-call

Download Transcript

Full bilingual transcript available immediately

Translation Summary Card

Surfaces automatically after the call, no searching for it

Privacy Reminder

Confirms transcript complies with privacy policies, reinforcing trust post-call

Download Transcript

Full bilingual transcript available immediately

Outcomes and Ownership

Outcomes and Ownership

Outcomes

Outcomes

A concept and research project that went from structured user observation to a fully functioning AI voice prototype, validating a real product gap before Google shipped the same idea.

A concept and research project that went from structured user observation to a fully functioning AI voice prototype, validating a real product gap before Google shipped the same idea.

5

5

Research Participants

Research Participants

3

3

Core Features Designed

Core Features Designed

I/O 2025

I/O 2025

Google Validated the concept

Google Validated the concept

Ownership

Ownership

I led the end-to-end design, from research synthesis to interaction design, component decisions, and prototype production using Veo3 and ElevenLabs.

I led the end-to-end design, from research synthesis to interaction design, component decisions, and prototype production using Veo3 and ElevenLabs.

Ownership Flow

Ownership Flow

Research

Prototype

HSE

HSE

Framework

Framework

Research

Interaction

Interaction

Design

Design

System

System

Design

Design

Prototypes

Validation

Validation

Context

This project was designed to live inside Google Meet, not as a standalone translation tool, but as a native feature that fits into how a call already works, built for the moment language becomes a barrier.

Why Now?

Remote work made multilingual meetings the norm, but the tools never caught up.

1.8B Google Meet Users

1.8B+ people use Google Meet globally across work, education, and healthcare

Non-native English Speakers

Non-native English speakers make up the majority of Google Meet's user base

Google Translate Workaround

Existing workarounds like Google Translate require leaving the call entirely

Real-time AI translation

Real-time AI translation became technically feasible, but no platform had integrated it

Stake

Design a translation feature people would actually trust and use mid-call.

Constraint

Concept project with no access to Google's design system or internal usage data.

Success

A feature that feels native to Meet, not bolted on.

Problem & Opportunity

Google Meet connects the world, but when conversations cross languages, the experience breaks, users lose context, fall behind, and reach for workarounds that pull them out of the call.

Before

Google Meet = Captions only, no translation

Missing

Native, real-time voice translation with user control

Opportunity

A privacy-first translation feature built into the call itself

Target User

People in multilingual meetings who need to communicate across language barriers without disrupting the flow of the call.

International Student

Remote-team Collaborator

Healthcare Professionals

Cross-border Business User

User Core Needs

Minimal Delay

Minimal Delay

Accurate
Translation

Accurate
Translation

Provacy Control

Provacy Control

One-click
Access

One-click
Access

Post-call
Clarity

Post-call
Clarity

Ideation

One clear direction from the start: build translation natively into Meet, not around it.

Final Plan

Research Insights

Consent Design

Live Translation

Post-call transcript

Prototype

Design System & Visual Language

Since Google Meet lives within the Google Ecosystem, I worked within Material 3 Guidelines, using Google Sans typography, and Google's color system to ensure every feature felt like it already belonged.

Addition to Design System

See you next time around!

Thank You

GET IN TOUCH

I'm currently open for exciting opportunities.
Let’s connect and talk about the next big thing!

✦ LET'S CHAT ✦ LET'S CHAT ✦ LET'S CHAT

dhvans © 2025
Designed with 🤍 by Dhvani Shah

Thank You

GET IN TOUCH

See you next time around!

I'm currently open for exciting opportunities.
Let’s connect and talk about the next big thing!

✦ LET'S CHAT ✦ LET'S CHAT ✦ LET'S CHAT

dhvans © 2025
Designed with 🤍 by Dhvani Shah

See you next time around!

Thank You

GET IN TOUCH

I'm currently open for exciting opportunities.

Let’s connect and talk about the next big thing!

dhvans © 2025
Designed with 🤍 by Dhvani Shah

✦ LET'S CHAT ✦ LET'S CHAT ✦ LET'S CHAT

Final Research Impact: Every feature, consent first, live captions, post-call transcript, mapped directly to what participants said they needed.

User Research

Structured observation sessions and concept testing to understand where language barriers break down in live calls, and what users actually needed from a translation feature.

01 Community Signals

5 participants across 5 sessions

People were already trying to stay accountable together through shared presence, check-ins, and informal co-working.

02 Direct User Research

All 5 participants responded after seeing the feature Introduced the translation concept post-session and collected direct feedback on expectations, concerns, and placement preferences.

03 HSE Principles Framework

5 principles applied: Perception, Attention, Cognitive Load, Expectations, User Control
Mapped observations to design decisions, turning behavioral patterns into structured design rationale.

04 Competitive Landscape Tools analysed

Google Translate, Google Meet captions
Every workaround required leaving the call. No platform had natively integrated real-time voice translation, confirming a clear product gap.

Google Meet Captions

Google Translate

Product Goals & Strategy

Structured observation sessions and concept testing to understand where language barriers break down in live calls, and what users actually needed from a translation feature.

The tradeoff we optimised for:

Outcomes

A concept and research project that went from structured user observation to a fully functioning AI voice prototype, validating a real product gap before Google shipped the same idea.

5

Research Participants

3

Core Features Designed

I/O 2025

Google Validated the concept

Outcomes & Ownership

Ownership

I led the end-to-end design, from research synthesis to interaction design, component decisions, and prototype production using Veo3 and ElevenLabs.

Ownership Flow

HSE Framework

Interaction Design

Research

Prototype

System Design

Validation

Key Screens

Primary Consent

Live Translation

Post-Call Transcript