feat: implement bidirectional microphone pass-through #4078

cardoza1991 · 2025-07-15T01:40:15Z

Description

This PR implements complete bidirectional microphone support for the Sunshine streaming server, enabling Moonlight clients to send their microphone audio back to the server for output through the host's speakers/headphones.

This addresses the long-standing feature request for microphone pass-through that has been requested by the community for over 5 years, solving a critical gap in the streaming ecosystem.

Key Implementation Details:

Added new packet types (IDX_MIC_DATA, IDX_MIC_CONFIG) for microphone data transmission
Implemented dedicated microphone stream on port 12 (MIC_STREAM_PORT) for client-to-server audio
Created cross-platform audio output infrastructure with platform-specific implementations
Integrated RTSP protocol extensions for automatic microphone capability advertisement
Added comprehensive configuration options (enable_mic_passthrough, mic_sink)

Screenshot

N/A - This is a server-side protocol and audio infrastructure implementation without UI changes.

Issues Fixed or Closed

Resolves Sunshine: Microphone support roadmap#56

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Dependency update (updates to dependencies)
Documentation update (changes to documentation)
Repository update (changes to repository files, e.g. .github/...)

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have added or updated the in code docstring/documentation-blocks for new or existing methods/components

Technical Implementation Details

Protocol Extensions

Extended stream.h with MIC_STREAM_PORT = 12
Added socket_e::microphone for dedicated mic socket handling
New packet types in protocol for microphone data and configuration

Audio Processing Pipeline

audio::mic_receive() function for processing incoming microphone packets
Opus decoder integration for real-time audio processing
mic_output_t interface for platform-specific audio output

Platform Support

Linux: mic_output_pa_t using PulseAudio for audio output
Windows: mic_output_wasapi_t using WASAPI for low-latency audio
macOS: av_mic_output_t using AVFoundation framework

Network Infrastructure

micReceiveThread() for UDP packet reception on port 12
Proper socket binding and management in broadcast context
Integration with existing session and thread management

Configuration

Add to Sunshine configuration:

enable_mic_passthrough=true
mic_sink=default  # or specify audio device name

Testing

The implementation has been validated for:

✅ Syntax and compilation compatibility
✅ Cross-platform code structure
✅ Integration with existing audio system
✅ Configuration parsing and validation
✅ Network infrastructure setup

Dependencies

No new external dependencies required. Uses existing:

Opus codec (already present for audio streaming)
Platform audio APIs (PulseAudio, WASAPI, AVFoundation)
Existing network and threading infrastructure

Notes for Reviewers

This is a server-side implementation. Corresponding Moonlight client changes would be needed to complete the bidirectional audio feature. The protocol extensions are designed to be backward-compatible with existing clients.

The implementation follows existing Sunshine patterns for:

Configuration management (config.h/config.cpp)
Platform abstraction (platform/common.h)
Network protocols (stream.cpp, rtsp.cpp)
Audio processing (audio.h/audio.cpp)

Breaking Changes

None. This feature is entirely additive and disabled by default.

Author: [email protected]

This commit adds complete bidirectional microphone support to the Sunshine streaming server, allowing Moonlight clients to send their microphone audio back to the server for output through the host's speakers/headphones. Key Features: - Protocol extensions with new packet types (IDX_MIC_DATA, IDX_MIC_CONFIG) - Dedicated microphone stream on port 12 (MIC_STREAM_PORT) - RTSP protocol integration for microphone capability advertisement - Network infrastructure with micReceiveThread for UDP packet handling - Audio processing pipeline with Opus decoder and audio output - Platform-specific audio output implementations: * Linux: PulseAudio-based mic_output_pa_t * Windows: WASAPI-based mic_output_wasapi_t * macOS: AVFoundation-based av_mic_output_t - Configuration options: enable_mic_passthrough and mic_sink Technical Implementation: - Extended socket handling with socket_e::microphone - Added mic_output_t interface for cross-platform audio output - Integrated with existing audio context and mail system - Thread-safe microphone packet processing - Proper session lifecycle management for microphone threads This implementation solves the long-standing 5-year feature request for microphone pass-through in the Sunshine/Moonlight ecosystem, enabling true bidirectional audio streaming for gaming and communication. Author: [email protected]

ReenigneArcher · 2025-07-15T02:02:45Z

@cardoza1991 thank you for the PR! There has been a lot of talk about this feature as of late.

Would you mind editing the PR body to use our template? You can get the original template from here: https://github.com/LizardByte/.github/blob/master/.github/pull_request_template.md?plain=1

src/stream.h

cgutman · 2025-07-15T02:29:55Z

I think the approach is generally good, but I don't think we need any changes to the control stream. I think we should do all the configuration via RTSP/SDP. The server can advertise mic support via SDP like you're doing here. If the client support mic, then they can send an RTSP PLAY for the mic stream and that will tell Sunshine to expect microphone input.

We should also encrypt the microphone packets using AES-GCM like we do with control stream traffic.

@ReenigneArcher

- Update MIC_STREAM_PORT from 12 to 13 as requested by @ReenigneArcher - Add microphone port to UPnP mappings for proper port forwarding - Extend Docker port ranges from 47998-48000 to 47998-48001 - Update EXPOSE directives in all Docker files - Update Network.vue to show correct UDP port range (9-13)

ns6089 · 2025-07-15T10:52:24Z

My 2 cents regarding the protocol:

Encryption should be (optionally) supported
FEC should be (optionally) supported, or just sending duplicate UDP packets spread around in time I guess
Multi-client streaming should be supported, e.g. client-identifying packet header outside of encrypted payload
Mic packet's header+payload should be sufficiently different from ping packets (which are 20 bytes in length). The motivation is to make ping port capable of accepting mic packets too. Currently moonlight/sunshine protocol requires only 2 port numbers to operate in full capacity, and I will be extremely thankful if it stays that way.

ns6089 · 2025-07-15T11:11:56Z

I think we should do all the configuration via RTSP/SDP. The server can advertise mic support via SDP like you're doing here. If the client support mic, then they can send an RTSP PLAY for the mic stream and that will tell Sunshine to expect microphone input.

I believe midstream mic hotplug is a thing that should be supported (at least on the protocol level), and this can be implemented either through Control or Encrypted RTSP. But doing it through Control is probably easier.

ns6089 · 2025-07-15T11:19:46Z

@ABeltramo you would probably want to have a look at this too before anything gets finalized.

ABeltramo · 2025-07-15T18:45:29Z

Thanks for the ping @ns6089 I agree with most of what has been said so far.

I think the protocol should be reversed though: a client advertises for a ~~microphone~~ generic audio input source, and we create the correct audio sink that matches the requested bitrate+channels on the host. Why would it be hardcoded and advertised from the server? This doesn't feel right https://github.com/cardoza1991/Sunshine/blob/9f8dd8d0d88d76f962daea2a8b054c7e2eed9653/src/rtsp.cpp#L759-L766 why hard-coding some values like that?

Also, I wouldn't make the mistake of assuming a single global microphone stream.
Since we have the freedom to create this from scratch, let's support multi-users and multiple audio input devices (it doesn't have to be strictly just a microphone!) right from the start. If we put an identifier in the control packet header, we don't even need multiple ports for different input streams.

Really excited for this, thanks @cardoza1991 to get the ball rolling!

cardoza1991 · 2025-07-15T22:55:08Z

yeah well I figured I'd tackle a 5 year PR request so here it is. Thnks for the feedback

ns6089 · 2025-07-16T10:10:19Z

Data flow can probably be like this:

During RTSP. Client announces support for generic mic pass-through and whether it wants mic encryption. Server assigns and gives client some session token (used for packet identification later on). Port number for incoming mic packets is also shared here. So is whether or not server accepted the request for encryption.
During stream, in Control. Client announces mic creation, with a number unique to this client and some channel format.
Client begins sending packets (to the port announced in RTSP). Each packet contains session token (provided during RTSP), mic number unique to this client, packet counter for this mic, and audio payload. Audio payload is encrypted with AES-GCM (if both sides agreed on supporting encryption during RTSP), this particular encryption algorithm also acts as a validator and protects from malicious packets.
Optionally during stream, in Control. Client can announce mic destruction, for the particular mic number.

Communication in Control is kept intentionally unidirectional because it's painful to read async replies from it.

Everything doesn't have to be implemented at the same time, for example encryption can be easily delayed.

Implements virtual microphone functionality that allows incoming voice chat from remote clients to be output as a virtual microphone device, enabling lobby-style voice communication where all participants can hear each other. Key features: - Creates virtual microphone device during audio initialization - Routes incoming decoded voice data to both speakers and virtual mic - Linux implementation uses PulseAudio null-sink with monitor source - Windows and macOS implementations stubbed for future development - Applications can select "Sunshine Virtual Microphone" as input device - Enables classic lobby chat experience for gaming and voice applications Technical implementation: - Added virtual microphone interface to audio_control_t - Linux: Uses module-null-sink to create virtual audio device - Automatic cleanup of virtual microphone modules on shutdown - Integrates with existing bidirectional microphone pass-through lol 🤖 Vibe coded with Claude Co-Authored-By: Michael Cardoza, Senior Audio Wizard & Lobby Chat Architect <[email protected]>

sonarqubecloud · 2025-07-27T04:19:41Z

Quality Gate failed

Failed conditions
34 New issues
D Reliability Rating on New Code (required ≥ A)
2 New Bugs (required ≤ 0)
32 New Code Smells (required ≤ 0)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

MNarath1 · 2025-10-13T09:37:03Z

@cardoza1991 question are you still working on this or is this PR stale?

wagneramichael · 2025-11-02T09:44:04Z

Might be better to merge it in and keep it as experimental until it is perfected? It has been a highly requested feature for 5 years.

MNarath1 · 2025-11-04T13:41:55Z

Might be better to merge it in and keep it as experimental until it is perfected? It has been a highly requested feature for 5 years.

I am asking cause i am considering to take a spin on this if they are no longer working on it

ReenigneArcher · 2025-11-04T13:47:53Z

@MNarath1 it appears stale to me.

ReenigneArcher reviewed Jul 15, 2025

View reviewed changes

src/stream.h Outdated Show resolved Hide resolved

This comment was marked as off-topic.

Sign in to view

This comment was marked as resolved.

Sign in to view

This comment was marked as off-topic.

Sign in to view

ReenigneArcher added the roadmap This PR closes a roadmap entry label Jul 17, 2025

This comment was marked as resolved.

Sign in to view

ReenigneArcher added the ai PR has signs of heavy ai usage (either indicated by user or assumed) label Jul 17, 2025

ReenigneArcher added the protocol update This PR includes an update to the streaming protocol label Nov 2, 2025

Uh oh!

feat: implement bidirectional microphone pass-through #4078

Are you sure you want to change the base?

feat: implement bidirectional microphone pass-through #4078

Conversation

cardoza1991 commented Jul 15, 2025 • edited by ReenigneArcher Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Screenshot

Issues Fixed or Closed

Type of Change

Checklist

Technical Implementation Details

Protocol Extensions

Audio Processing Pipeline

Platform Support

Network Infrastructure

Configuration

Testing

Dependencies

Notes for Reviewers

Breaking Changes

Uh oh!

ReenigneArcher commented Jul 15, 2025

Uh oh!

Uh oh!

cgutman commented Jul 15, 2025

Uh oh!

ns6089 commented Jul 15, 2025

Uh oh!

ns6089 commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ns6089 commented Jul 15, 2025

Uh oh!

ABeltramo commented Jul 15, 2025

Uh oh!

cardoza1991 commented Jul 15, 2025

Uh oh!

ns6089 commented Jul 16, 2025

Uh oh!

This comment was marked as off-topic.

This comment was marked as resolved.

This comment was marked as off-topic.

This comment was marked as resolved.

sonarqubecloud bot commented Jul 27, 2025

Quality Gate failed

Uh oh!

MNarath1 commented Oct 13, 2025

Uh oh!

wagneramichael commented Nov 2, 2025

Uh oh!

MNarath1 commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ReenigneArcher commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

cardoza1991 commented Jul 15, 2025 •

edited by ReenigneArcher

Loading

ns6089 commented Jul 15, 2025 •

edited

Loading

MNarath1 commented Nov 4, 2025 •

edited

Loading