pro guide
everything you can do with shrp — from free voice typing to pro file transcription, youtube transcripts, on-demand suggested transcript actions, text-to-speech, smart voice tools, api keys, mcp, and cloud sync.
// quick start
file upload (paid plans)
- 1. go to your dashboard
- 2. drag and drop your audio/video file
- 3. select language or auto-detect
- 4. wait for processing (1-5 minutes)
- 5. download as txt, docx, srt, vtt, or json
live voice typing (free)
- 1. go to speech to text
- 2. choose your language (55+ supported)
- 3. allow microphone permissions
- 4. start speaking — text appears live
- 5. copy, download, or save to projects
// tools
speech to text
Live voice transcription in your browser using Web Speech API. 55+ languages, auto-punctuation, sentence case. Free forever, no account needed.
file upload transcription
Upload audio/video files (MP3, WAV, MP4, MOV, M4A, AAC) for transcription powered by AssemblyAI. Cloud includes 60 min/mo and 100MB uploads. Starter and Pro add speaker diarization, timestamps, word-level confidence, and AI extraction.
youtube to text
Paste a YouTube link and retrieve the available transcript as readable text. Generate suggested actions on demand, ask custom transcript prompts, save outputs when signed in, and repurpose video content.
text to speech
Convert text to natural-sounding audio using Google Cloud voices. Choose from 2,000+ voices across 60+ languages, adjust speed, preview voices, and download MP3 audio.
smart voice tools
Turn speech or pasted transcript text into meeting notes, action items, email drafts, reports, summaries, invoice details, and other structured outputs. YouTube transcript actions use the smart transcript action limits.
invoice generator
Speak or type invoice details, let SHRP structure them, then review and print or save the invoice as PDF.
pdf to audio estimator
Upload a text PDF to count pages, words, characters, and estimated listening time. Full PDF-to-audio generation is in early access, so v1 does not generate audio.
rest api
Use SHRP from scripts and automations with API keys. Live endpoints include speech-to-text, text-to-speech, and YouTube transcript extraction.
mcp server
Connect SHRP to Claude Desktop, Cursor, Windsurf, or compatible AI clients. Current MCP tools include YouTube transcript extraction and text-to-speech.
// pro features in detail
confidence heatmap
Every word in your transcript is color-coded by confidence score. Green = high confidence, yellow = medium, red = low. Instantly see which words to verify — no need to re-listen to the entire file.
speaker diarization
Automatically identifies and labels different speakers in your audio. Perfect for interviews, meetings, and multi-person recordings. Each speaker gets a label (Speaker A, Speaker B, etc.) with their dialogue grouped together.
concurrent transcriptions
Starter: 1 file at a time. Pro: up to 3 files processing simultaneously. Upload a batch and let them all transcribe in parallel.
batch upload
Drag and drop multiple audio files at once. They queue up and process one after another (or concurrently on Pro). Track progress for each file individually.
auto-save to file system
Uses the File System Access API to auto-save your transcript to a real file on your device as you speak. Survives browser crashes. No cloud needed.
project organization
Save transcripts into folders with custom tags. Free accounts get 25 cloud projects; Cloud plan and above sync unlimited projects across devices.
// plans
free — $0 — live voice typing, 55+ languages, save 25 cloud projects, 10K tts chars plus 5K signup bonus
cloud — $3/mo — unlimited cloud project sync, 60 min/mo file transcription, 100MB uploads
starter — $7/mo — 200 min/mo transcription, 100MB uploads, speaker labels, ai extraction, 200K tts/mo
pro — $15/mo — 1500 min/mo, 500MB uploads, 3 concurrent, speaker labels, ai extraction, 500K tts/mo, api access, priority support
// dashboard features
usage tracking
See your monthly transcription minutes, file uploads, TTS character usage, and credit balance at a glance.
transcription history
Browse all past transcriptions. Starter and Pro can view word-level timestamps, speaker labels, confidence highlighting, and advanced exports.
my projects
Organize saved transcripts into folders with tags. Cloud plan and above syncs across devices.
account settings
Manage your subscription, update password, export all data, or delete your account. Billing handled via Stripe customer portal.
api keys
Create and revoke SHRP API keys from the dashboard. Keys are shown once, stored as hashes, and use your existing plan limits.
// api and mcp
REST API — call SHRP from scripts, backend jobs, internal tools, and automations. API usage follows your existing SHRP plan and credits.
MCP server — expose selected SHRP tools inside compatible AI clients. This is mainly for developers and agent workflows; most browser users do not need it.
// export formats
txt — plain text transcript
docx — formatted word document
srt — subtitle file with timestamps
vtt — web subtitle format
json — full data with segments, speakers, confidence
zip — all formats bundled together
// tips for better accuracy
live voice typing
- - use a quiet environment
- - speak clearly at moderate pace
- - use a good quality microphone
- - select the correct language
- - enable auto-punctuation
file uploads
- - supported: mp3, wav, mp4, mov, m4a, aac
- - max size: cloud 100MB, starter 100MB, pro 500MB
- - processing: ~1-5 minutes typical
- - better audio quality = better results
- - batch upload multiple files at once