Speaker to Text (Speaker2Log)

The Speaker to Text feature (Speaker2Log) allows you to transcribe system audio output to text. This is perfect for capturing and translating what other players are saying in VRChat voice chat.

Overview

Speaker2Log captures audio from your selected speaker/audio output device and:

Transcribes the audio to text
Translates the transcribed text (if translation is enabled)
Displays the results in the chat history

This allows you to "read" what others are saying through voice chat.

How to Enable

Toggle Speaker2Log

In the main window, locate the Speaker2Log toggle switch
Click to enable speaker transcription
The toggle will turn on when active

Important

Make sure you have selected the correct speaker/audio output device in VRCT settings before enabling this feature.

Configuration

Select Speaker Device

Open the Config Window (click the gear icon)
Navigate to the Device section
Select your audio output device from the speaker dropdown list
Click Save

Tip

Select the same device that VRChat is using for audio output.

For detailed device configuration, see the Device Config Guide.

Adjust Transcription Settings

Open the Config Window
Navigate to the Transcription section
Configure speaker transcription settings

For detailed transcription settings, see the Transcription Config Guide.

How to Use

Basic Usage

Enable Speaker2Log toggle
When others speak in VRChat voice chat or other application audio
VRCT captures and transcribes the audio
Transcribed text appears in the chat history (left side)

With Translation

Enable both Translation and Speaker2Log toggles
Set the expected source language
Set your preferred target language
When others speak, their speech will be transcribed and translated
View both original and translated text in the chat history

Features

Real-time Transcription

Audio is transcribed in near real-time as people speak.

Translation Integration

Seamlessly integrates with translation features to help you understand foreign languages.

Important Warnings

Device Changes

Critical

If you change the speaker device in Windows while VRCT is transcribing the speaker, VRCT may freeze or crash.

To safely change the speaker device:

Disable Speaker2Log in VRCT first
Change the speaker device in Windows
Update the speaker device in VRCT settings
Re-enable Speaker2Log

Privacy Considerations

Be aware that Speaker2Log transcribes ALL audio from the selected device, including:

VRChat voice chat
Desktop audio
Music, videos, or other applications using that audio device

Best Practices

For Better Accuracy

Clear Audio Source: Ensure good audio quality from VRChat
Reduce Background Noise: Minimize desktop audio from other applications
Proper Volume Levels: Set VRChat voice volume to appropriate levels
Select Correct Language: Configure the expected language in settings

For VRChat Usage

Volume Balance: Adjust VRChat voice volume for optimal transcription
Monitor System Resources: Speaker transcription can be resource-intensive

Use Cases

Accessibility

For players with hearing difficulties
Read voice chat as text
Keep conversation logs

Communication Support

Bridge language barriers in international worlds
Understand conversations in worlds where you don't speak the language
Keep written records of voice conversations

Troubleshooting

Speaker Transcription Not Working

Verify the correct audio device is selected in settings
Check that the device is not muted in Windows
Ensure VRChat audio is playing through the selected device
Test audio playback in Windows

Poor Transcription Accuracy

Increase VRChat voice volume
Reduce background application audio
Select the correct source language
Try a different transcription engine
Check audio output quality

VRCT Freezing

Disable Speaker2Log before changing Windows audio devices
Restart VRCT if frozen
Check system resources (CPU/GPU/RAM)

Cannot Hear Audio

Check Windows audio settings
Verify VRChat audio output device
Check application mixer volumes

Performance Considerations

System Resources

Speaker transcription requires:

Continuous CPU/GPU processing
RAM for audio buffering
More resources than microphone transcription (processes all desktop audio)

Optimization Tips

Use lighter transcription models
Close unnecessary audio applications
Disable when not needed
Adjust transcription quality vs performance in settings

Privacy & Ethics

Respect Others' Privacy

Be mindful that you're transcribing others' conversations
Use this feature responsibly and ethically
Follow VRChat Terms of Service and community guidelines
Don't share transcribed private conversations without permission

Data Handling

Transcribed text is stored locally in VRCT
Cloud-based engines may send audio data to external servers
Choose local engines if privacy is a concern

Voice to Text - Transcribe your own voice
Receive Message - Learn about receiving messages
Real-time Translation - Translate transcribed text
Device Config - Configure speaker device
Transcription Config - Configure transcription engine

Overview​

How to Enable​

Toggle Speaker2Log​

Configuration​

Select Speaker Device​

Adjust Transcription Settings​

How to Use​

Basic Usage​

With Translation​

Features​

Real-time Transcription​

Translation Integration​

Important Warnings​

Device Changes​

Privacy Considerations​

Best Practices​

For Better Accuracy​

For VRChat Usage​

Use Cases​

Accessibility​

Communication Support​

Troubleshooting​

Speaker Transcription Not Working​

Poor Transcription Accuracy​

VRCT Freezing​

Cannot Hear Audio​

Performance Considerations​

System Resources​

Optimization Tips​

Privacy & Ethics​

Respect Others' Privacy​

Data Handling​

Related Features​