Voice Calls
Start voice calls in channels with LiveKit-powered audio and AI agent participation
Voice calls in Saltare are real-time audio conversations attached to channels. They use LiveKit for WebRTC audio, and — uniquely — AI agents can join calls, listen via transcription, and respond with synthesized speech.
Starting a Call
To start a voice call, click the phone icon in any channel header. When you initiate a call:
- A call room is created and you join immediately
- All other channel members receive an incoming call notification
- A system message is posted in the channel announcing the call
- Members have 30 seconds to join before the call times out (if you're the only participant)
Joining and Declining
When someone starts a call in a channel you belong to, you'll see a ringing indicator with options:
- Accept — Join the call immediately
- Decline — Dismiss the notification (the caller sees you declined)
You can also join an active call at any time by clicking the call indicator in the channel.
During a Call
Controls
- Mute / Unmute — Toggle your microphone
- Leave — Exit the call (the call continues for other participants)
- End Call — End the call for everyone (only the initiator can do this)
Call Timer
A timer shows how long the call has been active, displayed in the call panel.
Participant List
See who's on the call with speaking indicators that show who's currently talking.
Adding AI Agents
The most powerful feature of Saltare voice calls is agent participation. You can add any workspace agent to an active call.
How It Works
- Click "Add Agent" during an active call
- Select an agent from your workspace
- The agent joins and begins listening via speech transcription
What Happens Behind the Scenes
- Your speech is transcribed using the Web Speech API in your browser
- Transcriptions are sent to the agent in real-time
- The agent processes your words, selects tools, and generates a response
- The response is converted to audio via text-to-speech
- Audio is played back to all call participants
Use Cases
- Quick research — "Hey @Researcher, what's Acme Corp's latest funding round?"
- Live task creation — "Create a task for the items we just discussed"
- Meeting notes — Agents listen and can summarize the conversation afterward
- Data lookup — "What does our sales database show for Q2?"
Removing Agents
Click "Remove Agent" to disconnect an agent from the call. Transcription stops for that agent.
Call Lifecycle
RINGING → Waiting for participants (30s timeout)
ACTIVE → Call in progress with connected participants
ENDED → Call terminated (by initiator or last human leaving)
Important behaviors:
- If the last human participant leaves, the call ends automatically (agents alone don't keep a call alive)
- The initiator can end the call for everyone at any time
- Calls that go unanswered for 30 seconds are marked as missed
Post-Call Processing
After a call ends, Saltare can generate:
Transcripts
A timestamped record of everything said during the call, attributed to each speaker. Transcripts are stored as documents for future reference.
Summaries
Use the summarize_voice_call agent tool to get a structured summary of what was discussed:
@Assistant summarize the voice call we just had in #engineering
The agent pulls the transcript and produces a summary with key decisions, action items, and topics covered.
System Messages
Voice calls post system messages in the channel:
- When a call starts
- When participants join or leave
- When the call ends (with duration)
This keeps an audit trail of calls in the channel history.
Plan Requirements
Voice calls require the Pro plan or higher:
- Voice Calls — Available on Pro, Business, and Enterprise
- Agent Participation — Also requires Pro or higher
Starter plan users will see an upgrade prompt when trying to start a call.
Tips
- Use agents for meeting notes — Add an agent at the start of important calls so it captures everything
- Keep calls in relevant channels — Calls are attached to channels, so start calls in the channel that matches the topic
- Review transcripts — Post-call transcripts are valuable for team members who couldn't attend
- Quick questions — Voice calls with agents are faster than typing for complex, conversational queries