Sample Rate
AudioHow many times per second an audio waveform is measured, expressed in kHz.
Standard sample rates are 44.1 kHz (CD quality), 48 kHz (broadcast/video standard), and 16 kHz (speech-optimized). Whisper internally downsamples to 16 kHz regardless of input — so a 48 kHz studio recording transcribes identically to a 16 kHz phone recording in terms of model performance, provided the audio is otherwise clear.
Schema Markup (JSON-LD)
SEOStructured data embedded in a webpage that tells search engines and LLMs exactly what the page is about.
Schema.org markup in JSON-LD format powers Google's rich results (FAQ snippets, How-To carousels, breadcrumbs, video carousels). For sermon pages, the most valuable schemas are Article, FAQPage, HowTo, BreadcrumbList, VideoObject, and DefinedTerm. Schema is what makes content show up in AI Overviews — it's the single highest-leverage technical SEO investment.
Scripture Reference
Ministry OutputA citation of a Bible passage in 'Book Chapter:Verse' form (e.g., John 3:16, Romans 8:28–30).
Scripture references are the highest-value structured data in any sermon. Modern AI can extract them automatically, linking each citation to a Bible API for inline display. A transcript with linked scripture references serves Bible-app users, drives SEO via long-tail Bible queries, and dramatically improves time-on-page.
Sermon Archive
DistributionA searchable, organized collection of a church's sermons spanning months or years.
Transcribed sermon archives convert a church website from a brochure into a research library. Members find past messages; first-time visitors search by topic, scripture, or pastor; Google indexes everything as long-tail organic traffic. Major archives like Grace To You (John MacArthur), Desiring God (John Piper), and Spurgeon Gems demonstrate the SEO power of decades of indexed text.
Sermon Clips
Content StrategyAlso called: Sermon Shorts, Sermon Reels
30–90 second vertical-video excerpts pulled from a full sermon for Reels, TikTok, and Shorts.
Sermon clips are the social-media equivalent of a movie trailer. The best clips share one self-contained idea, open with a hook in the first 3 seconds, and end with a question or call to read more. Transcripts make finding clip-worthy moments dramatically faster — Cmd-F through 5,000 words beats scrubbing 45 minutes of video.
Sermon Transcription
CoreThe process of converting recorded sermon audio (or video) into written, formatted text.
Sermon transcription is the conversion of preached audio into searchable, readable, indexable text. It serves accessibility (deaf and hard-of-hearing members), discoverability (Google can't index audio), content repurposing (blogs, social, email), and member engagement (note-taking, study, sharing). In 2026, AI transcription has made the process near-instant ($0.27 per sermon, 5 minutes turnaround).
Speaker ID
TechnologyIdentifying which named individual (not just 'Speaker 1') is speaking in a recording.
Speaker ID extends diarization by mapping anonymous speaker clusters to known voices via voice fingerprinting. Set it up by providing a few labeled samples ('this is Pastor John,' 'this is Worship Leader Sarah'); the model then labels every future recording automatically. ElevenLabs and AssemblyAI support speaker ID; OpenAI Whisper does not.
SRT (SubRip Subtitle File)
File FormatThe most common subtitle file format, a plain text file with timecodes and dialogue.
SRT files are tiny plain-text files containing numbered subtitle blocks, each with a start/end timecode and the dialogue text. YouTube, Vimeo, Facebook, and most video players accept SRT directly. Every sermon transcription service worth using exports SRT alongside plain text.
Subtitles
CaptionsOn-screen text translating dialogue, intended for viewers who can hear but don't speak the language.
Subtitles assume hearing — they don't include speaker labels or sound effects like '[congregation singing]'. They're typically used for translation (Spanish subtitles on an English sermon, for instance). Captions (the broader term) are for accessibility; subtitles are for translation. Modern AI services produce both from one source transcript.