Go to file

alexei.dolgolyov 3cfc437599 Add UI animations: dialogs, tabs, settings, browser stagger, banner pulse

- Dialog modals: scale+fade entrance/exit with animated backdrop
- Tab panels: fade-in with subtle slide on switch
- Settings sections: content slide-down on expand
- Browser grid/list items: staggered cascade entrance animation
- Connection banner: slide-in + attention pulse on disconnect
- Accessibility: prefers-reduced-motion disables all animations

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-08 19:45:02 +03:00

docs

Update media browser screenshot

2026-02-25 01:56:38 +03:00

media_server

Add UI animations: dialogs, tabs, settings, browser stagger, banner pulse

2026-03-08 19:45:02 +03:00

scripts

Add primary display indicator, custom accent color picker, restart script

2026-02-27 16:18:18 +03:00

.gitignore

Add media browser feature with UI improvements

2026-02-06 21:31:02 +03:00

CLAUDE.md

Add media browser feature with UI improvements

2026-02-06 21:31:02 +03:00

config.example.yaml

Add multi-token authentication with client labels

2026-02-06 03:37:35 +03:00

pyproject.toml

Add audio visualizer with spectrogram, beat-reactive art, and device selection

2026-02-27 21:42:19 +03:00

README.md

Add PWA support: installable standalone app with safe area handling

2026-03-01 13:17:56 +03:00

README.md

Media Server

A REST API server for controlling system media playback on Windows, Linux, macOS, and Android.

Features

Built-in Web UI for real-time media control and monitoring
Installable PWA - Add to home screen on mobile for a native app experience
Audio Visualizer - Real-time spectrum analyzer with beat-reactive album art effects
Media Browser - Browse and play media files from configured folders
Display Control - Monitor brightness and power management
Quick Actions & Scripts - Execute custom scripts with one click
Callbacks - Trigger commands on media events (play, pause, volume, etc.)
Control any media player via system-wide media transport controls
Play/Pause/Stop/Next/Previous track
Volume control and mute
Seek within tracks
Get current track info (title, artist, album, artwork)
WebSocket support for real-time updates
Token-based authentication with multi-token support
Dark/light theme with customizable accent colors
Multi-language support (English, Russian)
Cross-platform support (Windows, Linux, macOS, Android)

Web UI

The media server includes a built-in web interface for controlling and monitoring media playback.

Features

Real-time status updates via WebSocket connection
Album artwork display with glow effect and automatic updates
Vinyl record mode - Album art displayed as a spinning vinyl disc with grooves and center spindle
Playback controls - Play, pause, next, previous
Volume control with mute toggle
Seekable progress bar - Click to jump to any position
Mini player - Sticky compact player that appears when scrolling away from the main player
Connection status indicator - Know when you're connected
Token authentication - Saved in browser localStorage
Audio spectrum visualizer - Real-time frequency bars with beat-reactive album art scaling and glow (on-demand WASAPI loopback capture)
Display control - Monitor brightness adjustment and power on/off
Installable PWA - Add to home screen on mobile/desktop for standalone app experience with safe area support for notched phones
Responsive design - Works on desktop, tablet, and mobile
Dark and light themes - Toggle between dark and light modes with dynamic status bar theming
Accent color picker - Choose from 9 preset accent colors or pick a custom color
Tab-based navigation - Player, Display, Browser, Quick Actions, and Settings tabs
Multi-language support - English and Russian locales with automatic detection

Accessing the Web UI

Start the media server:
```
python -m media_server.main
```
Open your browser and navigate to:
```
http://localhost:8765/
```
Enter your API token when prompted (get it with media-server --show-token)
Start playing media in any supported player and watch the UI update in real-time!

Installing as a PWA

The Web UI can be installed as a Progressive Web App for a native app-like experience:

Open the Web UI in Chrome/Edge on your phone or desktop
Tap the Install icon in the address bar (or "Add to Home Screen" on mobile)
The app launches in standalone mode — no browser chrome, with proper safe area handling for notched phones

Audio Visualizer

The Web UI includes a real-time audio spectrum visualizer that captures system audio output:

On-demand capture - Audio capture starts only when a client enables the visualizer, and stops when the last client disconnects
Beat-reactive effects - Album art pulses and glows in response to bass frequencies
Configurable device - Select which audio output device to capture in Settings

Requires soundcard and numpy Python packages. Enable in config.yaml:

visualizer_enabled: true
# visualizer_device: "Speakers"  # optional: specific device name

Remote Access

To access the Web UI from other devices on your network:

Find your computer's IP address (e.g., 192.168.1.100)
Navigate to http://192.168.1.100:8765/ from any device on the same network
Enter your API token

Security Note: For remote access over the internet, use a reverse proxy with HTTPS (nginx, Caddy) to encrypt traffic.

Localization

The Web UI supports multiple languages with automatic browser locale detection:

Available Languages:

English (en) - Default
Русский (ru) - Russian

The interface automatically detects your browser language on first visit. You can manually switch languages using the dropdown in the top-right corner of the Web UI.

Contributing New Locales:

We welcome translations for additional languages! To contribute a new locale:

Copy media_server/static/locales/en.json to a new file named with your language code (e.g., de.json for German)
Translate all strings to your language, keeping the same JSON structure

Add your language to the supportedLocales object in media_server/static/index.html:

const supportedLocales = {
    'en': 'English',
    'ru': 'Русский',
    'de': 'Deutsch'  // Add your language here
};

Test the translation by switching to your language in the Web UI
Submit a pull request with your changes

See CLAUDE.md for detailed translation guidelines.

Media Browser

The Media Browser feature allows you to browse and play media files from configured folders directly through the Web UI.

Browser Features

Folder Configuration - Mount multiple media folders (music/video directories)
Recursive Navigation - Browse through folder hierarchies with breadcrumb navigation
Multiple View Modes - Grid, compact grid, and list views with toggle buttons
Thumbnail Display - Automatically generated thumbnails from album art (lazy-loaded)
Metadata Extraction - View title, artist, album, duration, bitrate, file size, and more
Remote Playback - Play files on the PC running the media server (not in the browser)
Play All - Play all media files in the current folder
File Download - Download individual media files directly from the browser
Search & Filter - Real-time search across files in the current folder
Pagination - Navigate large folders with configurable page sizes (25, 50, 100, 200, 500)
Last Path Memory - Automatically returns to your last browsed location

Configuration

Add media folders in your config.yaml:

# Media folders for browser
media_folders:
  music:
    path: "C:\\Users\\YourUsername\\Music"
    label: "My Music"
    enabled: true
  videos:
    path: "C:\\Users\\YourUsername\\Videos"
    label: "My Videos"
    enabled: true

# Thumbnail size: "small" (150x150), "medium" (300x300), or "both"
thumbnail_size: "medium"

How Playback Works

When you play a file from the Media Browser:

The file is opened using the default system media player on the PC running the media server
This is designed for remote control scenarios where you browse media from one device (e.g., Home Assistant dashboard, phone) but want audio to play on the PC
The media player must support the Windows Media Session API for playback tracking

Media Player Compatibility

⚠️ Important Limitation: Not all media players expose their playback information to the Windows Media Session API. This means some players will open and play the file, but the Media Server UI won't show playback status, track information, or allow remote control.

✅ Compatible Players (work with playback tracking):

VLC Media Player - Full support
Groove Music (Windows 10/11 built-in) - Full support
Spotify - Full support (if already running)
Chrome/Edge/Firefox - Full support for web players
foobar2000 - Full support (with proper configuration/plugins)

❌ Limited/No Support:

Windows Media Player Classic - Opens files but doesn't expose session info
Windows Media Player (classic version) - Limited session support

Recommendation: Set VLC Media Player or Groove Music as your default audio player for the best experience with the Media Browser.

Changing Your Default Media Player (Windows)

Open Windows Settings → Apps → Default apps
Search for "Music player" or "Video player"
Select VLC Media Player or Groove Music
Files opened from Media Browser will now use the selected player

API Endpoints

The Media Browser exposes several REST API endpoints:

Endpoint	Method	Description
`/api/browser/folders`	GET	List configured media folders
`/api/browser/browse`	GET	Browse directory contents
`/api/browser/metadata`	GET	Get media file metadata
`/api/browser/thumbnail`	GET	Get thumbnail image
`/api/browser/play`	POST	Open file with default player

All endpoints require bearer token authentication.

Security Notes

Path Traversal Protection - All paths are validated to prevent directory traversal attacks
Folder Restrictions - Only configured folders are accessible
Authentication Required - All endpoints require a valid API token

Requirements

Python 3.10+
Platform-specific dependencies (see below)

Installation

Windows

pip install -r requirements.txt

Required packages: winsdk, pywin32, pycaw, comtypes

Linux

# Install system dependencies
sudo apt-get install python3-dbus python3-gi libdbus-1-dev libglib2.0-dev

pip install -r requirements.txt

macOS

pip install -r requirements.txt

No additional dependencies - uses built-in osascript.

Android (Termux)

# In Termux
pkg install python termux-api
pip install -r requirements.txt

Requires Termux and Termux:API apps from F-Droid.

Quick Start

Generate configuration with API token:

python -m media_server.main --generate-config

View your API token:

python -m media_server.main --show-token

Start the server:
```
python -m media_server.main
```
Open the Web UI (recommended):
- Navigate to http://localhost:8765/ in your browser
- Enter your API token from step 2
- Start playing media and control it from the web interface!

Or test via API:

# Health check (no auth required)
curl http://localhost:8765/api/health

# Get media status
curl -H "Authorization: Bearer YOUR_TOKEN" http://localhost:8765/api/media/status

Configuration

Configuration file locations:

Windows: %APPDATA%\media-server\config.yaml
Linux/macOS: ~/.config/media-server/config.yaml

config.yaml

host: 0.0.0.0
port: 8765

# API Tokens - Multiple tokens with labels for client identification
api_tokens:
  home_assistant: "your-home-assistant-token-here"
  mobile: "your-mobile-app-token-here"
  web_ui: "your-web-ui-token-here"

poll_interval: 1.0
log_level: INFO

Authentication

The media server supports multiple API tokens with friendly labels. This allows you to:

Issue different tokens for different clients (Home Assistant, mobile apps, web UI, etc.)
Identify which client is making requests in the server logs
Revoke individual tokens without affecting other clients

Token labels appear in all server logs, making it easy to track and debug client connections:

2026-02-06 03:36:20,806 - media_server.services.websocket_manager - [home_assistant] - INFO - WebSocket client connected
2026-02-06 03:28:24,258 - media_server.routes.scripts - [mobile] - INFO - Executing script: lock_screen

Viewing your tokens:

python -m media_server.main --show-token

Output:

Config directory: C:\Users\...\AppData\Roaming\media-server

API Tokens:
  home_assistant       B04zhGDjnxH6LIwxL3VOT0F4qORwaipD7LoDyeAG4EU
  mobile               xyz123...
  web_ui               abc456...

Environment Variables

All settings can be overridden with environment variables (prefix: MEDIA_SERVER_):

export MEDIA_SERVER_HOST=0.0.0.0
export MEDIA_SERVER_PORT=8765
export MEDIA_SERVER_LOG_LEVEL=DEBUG

Note: For multi-token configuration, use the config.yaml file. Environment variables only support single-token mode.

API Reference

Health Check

GET /api/health

No authentication required. Returns server status and platform info.

Response:

{
  "status": "healthy",
  "platform": "Windows",
  "version": "1.0.0"
}

Get Media Status

GET /api/media/status
Authorization: Bearer <token>

Response:

{
  "state": "playing",
  "title": "Song Title",
  "artist": "Artist Name",
  "album": "Album Name",
  "album_art_url": "https://...",
  "duration": 240.5,
  "position": 120.3,
  "volume": 75,
  "muted": false,
  "source": "Spotify"
}

Media Controls

All control endpoints require authentication and return {"success": true} on success.

Endpoint	Method	Body	Description
`/api/media/play`	POST	-	Resume playback
`/api/media/pause`	POST	-	Pause playback
`/api/media/stop`	POST	-	Stop playback
`/api/media/next`	POST	-	Next track
`/api/media/previous`	POST	-	Previous track
`/api/media/volume`	POST	`{"volume": 75}`	Set volume (0-100)
`/api/media/mute`	POST	-	Toggle mute
`/api/media/seek`	POST	`{"position": 60.0}`	Seek to position (seconds)
`/api/media/turn_on`	POST	-	Execute on_turn_on callback
`/api/media/turn_off`	POST	-	Execute on_turn_off callback
`/api/media/toggle`	POST	-	Execute on_toggle callback

Script Execution

The server supports executing pre-defined scripts via API and the Web UI. Scripts and callbacks can be managed directly from the Web UI — add, edit, delete, and execute with real-time output display.

List Scripts

GET /api/scripts/list
Authorization: Bearer <token>

Response:

[
  {
    "name": "lock_screen",
    "label": "Lock Screen",
    "description": "Lock the workstation",
    "timeout": 5
  }
]

Execute Script

POST /api/scripts/execute/{script_name}
Authorization: Bearer <token>
Content-Type: application/json

{"args": []}

Response:

{
  "success": true,
  "script": "lock_screen",
  "exit_code": 0,
  "stdout": "",
  "stderr": ""
}

Configuring Scripts

Add scripts in your config.yaml:

scripts:
  lock_screen:
    command: "rundll32.exe user32.dll,LockWorkStation"
    label: "Lock Screen"
    description: "Lock the workstation"
    timeout: 5
    shell: true

  shutdown:
    command: "shutdown /s /t 0"
    label: "Shutdown"
    description: "Shutdown the PC immediately"
    timeout: 10
    shell: true

  restart:
    command: "shutdown /r /t 0"
    label: "Restart"
    description: "Restart the PC"
    timeout: 10
    shell: true

  hibernate:
    command: "shutdown /h"
    label: "Hibernate"
    description: "Hibernate the PC"
    timeout: 10
    shell: true

  sleep:
    command: "rundll32.exe powrprof.dll,SetSuspendState 0,1,0"
    label: "Sleep"
    description: "Put PC to sleep"
    timeout: 10
    shell: true

Script configuration options:

Field	Required	Description
`command`	Yes	Command to execute
`label`	No	User-friendly display name (defaults to script name)
`description`	No	Description of what the script does
`icon`	No	Custom MDI icon (e.g., `mdi:power`)
`timeout`	No	Execution timeout in seconds (default: 30, max: 300)
`working_dir`	No	Working directory for the command
`shell`	No	Run in shell (default: true)

Configuring Callbacks

Callbacks are optional commands executed after media actions. Add them in your config.yaml:

callbacks:
  # Media control callbacks (run after successful action)
  on_play:
    command: "echo Play triggered"
    timeout: 10
    shell: true

  on_pause:
    command: "echo Pause triggered"
    timeout: 10
    shell: true

  on_stop:
    command: "echo Stop triggered"
    timeout: 10
    shell: true

  on_next:
    command: "echo Next track"
    timeout: 10
    shell: true

  on_previous:
    command: "echo Previous track"
    timeout: 10
    shell: true

  on_volume:
    command: "echo Volume changed"
    timeout: 10
    shell: true

  on_mute:
    command: "echo Mute toggled"
    timeout: 10
    shell: true

  on_seek:
    command: "echo Seek triggered"
    timeout: 10
    shell: true

  # Turn on/off/toggle (callback-only actions, no default behavior)
  on_turn_on:
    command: "echo PC turned on"
    timeout: 10
    shell: true

  on_turn_off:
    command: "rundll32.exe user32.dll,LockWorkStation"
    timeout: 5
    shell: true

  on_toggle:
    command: "echo Toggle triggered"
    timeout: 10
    shell: true

Available callbacks:

Callback	Triggered by	Description
`on_play`	`/api/media/play`	After play succeeds
`on_pause`	`/api/media/pause`	After pause succeeds
`on_stop`	`/api/media/stop`	After stop succeeds
`on_next`	`/api/media/next`	After next track succeeds
`on_previous`	`/api/media/previous`	After previous track succeeds
`on_volume`	`/api/media/volume`	After volume change succeeds
`on_mute`	`/api/media/mute`	After mute toggle
`on_seek`	`/api/media/seek`	After seek succeeds
`on_turn_on`	`/api/media/turn_on`	Callback-only action
`on_turn_off`	`/api/media/turn_off`	Callback-only action
`on_toggle`	`/api/media/toggle`	Callback-only action

Callback configuration options:

Field	Required	Description
`command`	Yes	Command to execute
`timeout`	No	Execution timeout in seconds (default: 30, max: 300)
`working_dir`	No	Working directory for the command
`shell`	No	Run in shell (default: true)

Running as a Service

Windows Task Scheduler (Recommended)

Run in Administrator PowerShell from the project root:

.\media_server\service\install_task_windows.ps1

To remove the scheduled task:

Unregister-ScheduledTask -TaskName "MediaServer" -Confirm:$false

Windows Service

Install:

python -m media_server.service.install_windows install

Start/Stop:

python -m media_server.service.install_windows start
python -m media_server.service.install_windows stop

Remove:

python -m media_server.service.install_windows remove

Linux (systemd)

Install:

sudo ./service/install_linux.sh install

Enable and start for your user:

sudo systemctl enable media-server@$USER
sudo systemctl start media-server@$USER

View logs:

journalctl -u media-server@$USER -f

Command Line Options

python -m media_server.main [OPTIONS]

Options:
  --host TEXT          Host to bind to (default: 0.0.0.0)
  --port INTEGER       Port to bind to (default: 8765)
  --generate-config    Generate default config file and exit
  --show-token         Show current API token and exit

Security Recommendations

Use HTTPS in production - Set up a reverse proxy (nginx, Caddy) with SSL
Strong tokens - Default tokens are 32 random characters; don't use weak tokens
Firewall - Only expose the port to trusted networks
Secrets management - Don't commit tokens to version control

Supported Media Players

Windows

Spotify
Windows Media Player
VLC
Groove Music
Web browsers (Chrome, Edge, Firefox)
Any app using Windows Media Transport Controls

Linux

Any MPRIS-compliant player:
- Spotify
- VLC
- Rhythmbox
- Clementine
- Web browsers
- MPD (with MPRIS bridge)

macOS

Spotify
Apple Music
VLC (partial)
QuickTime Player

Android (via Termux)

System media controls
Limited seek support

Troubleshooting

"No active media session"

Ensure a media player is running and has played content
On Windows, check that the app supports media transport controls
On Linux, verify MPRIS with: dbus-send --print-reply --dest=org.freedesktop.DBus /org/freedesktop/DBus org.freedesktop.DBus.ListNames | grep mpris

Permission errors on Linux

Ensure your user has access to the D-Bus session bus
For systemd service, the DBUS_SESSION_BUS_ADDRESS must be set correctly

Volume control not working

Windows: Run as administrator if needed
Linux: Ensure PulseAudio/PipeWire is running

License

MIT License

Languages

Python 46%

JavaScript 30.6%

CSS 13.3%

HTML 8.5%

Shell 0.8%

Other 0.8%