This repo supports controlling the Windows desktop by voice commands. Using keyboard, mouse, and also your voice is intuitive, increases your productivity, and is really awesome.
Required is Windows 10 or 11, a headset, and the free Serenade voice recognition application.
- Download and install the free Serenade app from https://serenade.ai. During installation select no plugin and no programming language, then go to Settings > Server and choose the 'Local' endpoint for privacy and speed. Close the application when finished.
- Download and unzip this repo from https://github.com/fleschutz/talk2windows/archive/main.zip.
- Open 'Windows PowerShell (Administrator)' and execute:
Set-ExecutionPolicy RemoteSigned
to allow the execution of PowerShell scripts. - Right-click 'setup.ps1' in the talk2windows folder and select 'Execute with PowerShell'. It exports all voice commands with your personal wake word to Serenade. In the following the wake word Windows is used.
- Put your headset on and check audio and microphone.
- Launch Serenade and click the slider to switch from Paused to Listening mode (in case a 'low voice' is shown: raise the mic audio level).
- Say: "Windows, open calculator" with a calm voice into the mic (this voice command executes the PowerShell script
open-calculator.ps1
which launches the calculator app).
Most voice commands follow the generic scheme "<wake word>, <verb> <subject>" such as: "Windows, open calculator". The following voice commands are supported:
Say hello to your computer by replacing <greeting> with: good afternoon
, good evening
, good morning
, good night
, hello
, hey
, hi
, howdy
, morning
, or nice to meet you
.
Install a Microsoft Store application by replacing <app> with: 7-Zip
, CrystalDiskInfo
, CrystalDiskMark
, Discord
, Disney+
, Edge
, Firefox
, Google Chrome
, IrfanView
, Mp3tag
, Netflix
, Opera Browser
, Opera GX
, One Calendar
, Quick Share
, Rufus
, Spotify
, Sticky Notes
, TikTok
, Twitter
, Visual Studio Code
, VLC
, Windows Terminal
, or WhatsApp
. To uninstall it simply say: "Windows, uninstall <app>".
Launch the installed application by replacing <app> with: 3D-Viewer
, 7-Zip
, Calculator
, Character Map
, CrystalDiskInfo
, CrystalDiskMark
, Discord
, Disney+
, Edge
, Firefox
, Git Extensions
, Google Chrome
, Magnifier
, Microsoft Paint
, Microsoft Store
, Mp3tag
, Netflix
, Notepad
, OBS Studio
, One Calendar
, One Note
, Outlook
, Quick Share
, Remote Desktop
, Screen Clip
, Spotify
, Sticky Notes
, System Information
, Thunderbird
, Visual Studio
, VLC
, voice recorder
, Windows Terminal
, or WhatsApp
. To close it simply say: "Windows, close <app>".
Launch the Web browser (or open a new tab) with the given website. Replace <name> with: Amazon
, Apple
, Baidu
, BBC
, Bing
, BitBucket
, BitBucket status
, CDC
, CIA
, CNN
, Discord
, DistroWatch
, Docker
, Docker status
, Dropbox
, eBay
, Facebook
, FBI
, FlightRadar
, Flipboard
, FourSquare
, FRITZ!Box
, FRITZ!Repeater
, GitHub
, GliderTracker
, HolidayCheck
, HRworks
, Instagram
, IPFS
, Microsoft
, NASA
, NBC
, Netflix
, Notepad
, Outdoor Active
, PayPal
, Pinterest
, Pixabay
, Plex
, Serenade
, Slashdot
, Snap Store
, Space Weather
, SpaceX
, Spotify
, Starbucks
, Tesla
, TikTok
, Toggl
, Topo Map
, Twitter
, UFA
, Unsplash
, Walmart
, WhatsApp
, White House
, Windy
, Wikipedia
, Wolfram Alpha
, World News
, Yahoo
, YouTube
, or Zoom
. When finished say: "Windows, close tab".
Let the computer check something for you. Replace <item> with: audio
, Bitcoin rate
, CPU temperature
, dawn
, day
, DNS
, drives
, dusk
, Earth
(fun), Ether rate
, for crash dumps
, my downloads
, for rain
, for snow
, Git version
, headlines
, internet connection
, internet speed
, moon phase
, month
, New Year
, operating system
, Outlook
, ping latency
, recycle bin
, Santa
, swap space
, temperature
, Tether rate
, time zone
, up-time
, VPN
, weather
, week
, the wind
, year
, or zenith
.
Presses a keyboard shortcut to navigate in a browser window. Replace <shortcut> with: close tab
, last tab
, next page
, next tab
, previous page
, previous tab
, open new tab
, open new window
, reload page
, scroll down
, scroll to bottom
, scroll to top
, scroll up
, tab 1
, tab 2
, tab 3
, tab 4
, tab 5
, tab 6
, zoom in
, or zoom out
.
Inserts a character at the current text cursor position. Replace <char> with: at
, backslash
, comma
, copyright
, Dollar
, Euro
, hashtag
, minus
, number
, percent
, pipe
, plus
, Pound
, slash
, trademark
, underscore
, or Yen
.
Inserts text at the current text cursor position. Replace <text> with: bomb
, bye bye
, call?
, checkmark
, clapping
, date
, good-bye
, good morning
, heart
, OK
, shrugging
, smiley
, or thumbs up
.
Launches the Web browser (or opens a new tab) with the given Googleยฎ service. Replace <service> with: Alerts
, Blogger
, Books
, Calendar
, Charts
, Contacts
, Docs
, Drawings
, Earth
, Flights
, Gemini
, Keep
, Mail
, Maps
, Mars
, News
, Notes
, Photos
, Play
, Scholar
, Search
, Sheets
, Shopping list
, Sky
, Slides
, Stadia
, Tasks
, or Translate
. When finished say: "Windows, close tab".
Launches the Web browser (or opens a new tab) with the given magazine website. Replace <name> with: Cosmopolitan
, Focus
, Golf
, GQ
, Health
, National Geographic
, People
, RollingStone
, Scientific American
, Spiegel
, Sports Illustrated
, SZ
, The Oprah
, Time
, Vanity Fair
, Vogue
, or Wired
. When finished say: "Windows, close tab".
Launches the Web browser (or opens a new tab) with the given manual. Replace <name> with: Ant
, Apple
, Audacity
, Azure
, Bash
, Blender
, Chrome
, Edge
, Firefox
, GCC
, Git
, Jenkins
, MSBuild
, OBS Studio
, PowerShell
, Serenade
, SystemRescue
, Toyota
, Vim
, Voice
(yes, this page!), Volkswagen
, or Windows
. When finished say: "Windows, close tab".
Launches the Web browser (or opens a new tab) with the given city in Google Maps. Replace <name> with the city name. When finished say: "Windows, close tab".
Launches the Web browser (or opens a new tab) with the given exchange rate. Replace <named> with: Bitcoin
, Cardano
, Ether
, Euro
, NFT collections
, Solana
, Tether
, or US dollar
. When finished say: "Windows, close tab".
Launches the Web browser (or opens a new tab) and plays the given music genre. Replace <genre> with: 70s
, 80s
, 90s
, Alternative Rock
, Blues
, Chillout
, Christmas
, Classical
, Country
, Dance
, Folk
, Hip Hop
, House
, Indie
, Jazz
, Metal
, Pop
,Reggae
, Relax
, RnB
, Rock
, or Workout
. When finished say: "Windows, close tab".
Plays the given sound, just replace <name> with: alarm
, bee
, beep
, Big Ben
, car
, cat
, cow
, dog
, donkey
, elephant
, elk
, frog
, goat
, gorilla
, horse
, lion
, parrot
, pig
, rattlesnake
, vulture
, or wolf
.
Launches the Web browser (or opens a new tab) with the given browser game. Replace <game> with: 2048
, Agar
, Catan Universe
, Chess
, Cube
, ISS Docking Simulator
, Pacman
, Santa Tracker
, Slither
, Tetris
, TicTacToe
, Tower
, Vanis
, War Brokers
, or Wordle
. When finished say: "Windows, close tab".
Replaces the desktop background by a random photo from Unsplash.com. Replace <category> with: Animal
, Beach
, Car
, Christmas
, City
, Desert
, Fantasy
, Military
, Movie
, Music
, Nature
, Plane
, Random
, Space
, Sports
, or Winter
.
Set up a reminder, just replace <time> with: at # AM
, at # PM
, at midnight
, at noon
, at sunrise
, at sunset
, at tea time
, in # minutes
, or in # hours
.
Launches the File Explorer with the given drive. Replace <letter> with: C:
, D:
, E:
, F:
, G:
, H:
or M:
. When finished say: "Windows, close file explorer".
Launches the File Explorer with the given folder. Replace <named> with: apps
, applications
, autostart
, crash dumps
, desktop
, documents
, downloads
, Dropbox
, home
, music
, OneDrive
, pictures
, recycle bin
, repos
, SSH
, Talk2Windows
, temporary
, videos
, or Windows
. When finished say: "Windows, close file explorer".
Opens the given system settings, just replace <category> by: activation
, apps
, background
, backup
, bluetooth
, color
, date
, default apps
, developer
, display
, ethernet
, lockscreen
, maps
, personal
, printer
, proxy
, recovery
, speech
, start
, system
, taskbar
, themes
, time
, update
, USB
, VPN
, Wifi
, or Windows
. When finished say: "Windows, close system settings".
Shows the given table, just replace <table> with: bluetooth devices
, city weather
, earthquakes
, emojis
, environment variables
, headlines
, installed apps
, installed software
, installed voices
, network adapters
, printers
, running processes
, services
, system info
, tasks
, time zones
, or user groups
.
Performs the given action to the computer, replace <action> with: hibernate
, lock
, reboot
, shut down
, or suspend
.
Asks Windows a question, just replace <question> with: are you a computer?
, are you here?
, are you listening?
, are you ready?
, are you there?
, are you with me?
, can you talk?
, do you listen?
, how are you?
, how do you do?
, how much daylight today?
, how old are you?
, what about ...?
, what is [abbreviation]
, what's the date today?
, what's the temperature?
, what's the time?
, what time is it?
, when is Christmas?
, when is easter?
, when is midnight?
, when is noon?
, when is President's day?
, when is sunrise?
, when is sunset?
, when is tea time?
, where am I?
, where are you?
, where is the ISS
, who is your father?
, or you can talk?
.
Let Windows execute a command, replace <command> with: calculate [expression]
, connect VPN
, count down from [number]
, empty recycle bin
, enable god mode
, I need help
, locate my phone
, minimize all windows
, ping [host]
, repeat last reply
, roll a dice
, save screenshot
, set volume to [number] percent
, shut up
, sorry
, spell [text]
, tell me a joke
, tell me a quote
, thank you
, thanks
, translate [text] to [language]
, turn volume off
, turn volume on
, turn volume up
, turn volume fully up
, turn volume down
, update repository
, or wakeup [hostname]
.
Say good-bye to your computer by replacing <farewell> with: bye
, bye-bye
, ciao
, good-bye
, I'll be back
, see you
, or see you later
.
- Contributions, suggestions, and improvements are welcome!
- Open an Issue if you encounter bugs or have feature ideas.
- Create a Pull Request if you'd like to improve a script.
- Or just send your feedback to: markus.fleschutz [at] gmail.com
This open source project is licensed under the CC0 license. All trademarks are the property of their respective owners.