pro guide

everything you can do with shrp — from free voice typing to pro file transcription, youtube transcripts, on-demand suggested transcript actions, text-to-speech, smart voice tools, api keys, mcp, and cloud sync.

// quick start

file upload (paid plans)

  1. 1. go to your dashboard
  2. 2. drag and drop your audio/video file
  3. 3. select language or auto-detect
  4. 4. wait for processing (1-5 minutes)
  5. 5. download as txt, docx, srt, vtt, or json

live voice typing (free)

  1. 1. go to speech to text
  2. 2. choose your language (55+ supported)
  3. 3. allow microphone permissions
  4. 4. start speaking — text appears live
  5. 5. copy, download, or save to projects

// tools

🎤

speech to text

Live voice transcription in your browser using Web Speech API. 55+ languages, auto-punctuation, sentence case. Free forever, no account needed.

📁

file upload transcription

Upload audio/video files (MP3, WAV, MP4, MOV, M4A, AAC) for transcription powered by AssemblyAI. Cloud includes 60 min/mo and 100MB uploads. Starter and Pro add speaker diarization, timestamps, word-level confidence, and AI extraction.

▶️

youtube to text

Paste a YouTube link and retrieve the available transcript as readable text. Generate suggested actions on demand, ask custom transcript prompts, save outputs when signed in, and repurpose video content.

🔊

text to speech

Convert text to natural-sounding audio using Google Cloud voices. Choose from 2,000+ voices across 60+ languages, adjust speed, preview voices, and download MP3 audio.

🧠

smart voice tools

Turn speech or pasted transcript text into meeting notes, action items, email drafts, reports, summaries, invoice details, and other structured outputs. YouTube transcript actions use the smart transcript action limits.

🧾

invoice generator

Speak or type invoice details, let SHRP structure them, then review and print or save the invoice as PDF.

📚

pdf to audio estimator

Upload a text PDF to count pages, words, characters, and estimated listening time. Full PDF-to-audio generation is in early access, so v1 does not generate audio.

🔑

rest api

Use SHRP from scripts and automations with API keys. Live endpoints include speech-to-text, text-to-speech, and YouTube transcript extraction.

🤖

mcp server

Connect SHRP to Claude Desktop, Cursor, Windsurf, or compatible AI clients. Current MCP tools include YouTube transcript extraction and text-to-speech.

// pro features in detail

🎯

confidence heatmap

Every word in your transcript is color-coded by confidence score. Green = high confidence, yellow = medium, red = low. Instantly see which words to verify — no need to re-listen to the entire file.

👥

speaker diarization

Automatically identifies and labels different speakers in your audio. Perfect for interviews, meetings, and multi-person recordings. Each speaker gets a label (Speaker A, Speaker B, etc.) with their dialogue grouped together.

concurrent transcriptions

Starter: 1 file at a time. Pro: up to 3 files processing simultaneously. Upload a batch and let them all transcribe in parallel.

📦

batch upload

Drag and drop multiple audio files at once. They queue up and process one after another (or concurrently on Pro). Track progress for each file individually.

💾

auto-save to file system

Uses the File System Access API to auto-save your transcript to a real file on your device as you speak. Survives browser crashes. No cloud needed.

📂

project organization

Save transcripts into folders with custom tags. Free accounts get 25 cloud projects; Cloud plan and above sync unlimited projects across devices.

// plans

free — $0 — live voice typing, 55+ languages, save 25 cloud projects, 10K tts chars plus 5K signup bonus

cloud — $3/mo — unlimited cloud project sync, 60 min/mo file transcription, 100MB uploads

starter — $7/mo — 200 min/mo transcription, 100MB uploads, speaker labels, ai extraction, 200K tts/mo

pro — $15/mo — 1500 min/mo, 500MB uploads, 3 concurrent, speaker labels, ai extraction, 500K tts/mo, api access, priority support

view full pricing →

// dashboard features

📊

usage tracking

See your monthly transcription minutes, file uploads, TTS character usage, and credit balance at a glance.

📋

transcription history

Browse all past transcriptions. Starter and Pro can view word-level timestamps, speaker labels, confidence highlighting, and advanced exports.

📁

my projects

Organize saved transcripts into folders with tags. Cloud plan and above syncs across devices.

⚙️

account settings

Manage your subscription, update password, export all data, or delete your account. Billing handled via Stripe customer portal.

🔑

api keys

Create and revoke SHRP API keys from the dashboard. Keys are shown once, stored as hashes, and use your existing plan limits.

// api and mcp

REST API — call SHRP from scripts, backend jobs, internal tools, and automations. API usage follows your existing SHRP plan and credits.

MCP server — expose selected SHRP tools inside compatible AI clients. This is mainly for developers and agent workflows; most browser users do not need it.

api docs →mcp server →

// export formats

txt — plain text transcript

docx — formatted word document

srt — subtitle file with timestamps

vtt — web subtitle format

json — full data with segments, speakers, confidence

zip — all formats bundled together

// tips for better accuracy

live voice typing

  • - use a quiet environment
  • - speak clearly at moderate pace
  • - use a good quality microphone
  • - select the correct language
  • - enable auto-punctuation

file uploads

  • - supported: mp3, wav, mp4, mov, m4a, aac
  • - max size: cloud 100MB, starter 100MB, pro 500MB
  • - processing: ~1-5 minutes typical
  • - better audio quality = better results
  • - batch upload multiple files at once

// need help?

contact supportfaqgo to dashboardaccount settings