Video Intelligence API

Turn any video into
structured intelligence

One API call. Send a video or YouTube channel — get back frames, transcripts, and timestamped citations with the exact visual frame. Built for AI agents, developers, and researchers.

"ChatTube shows you what was said.
We show you what was shown."
Our Flagship Product

Channel Intelligence

Index an entire YouTube channel. Ask any question across hundreds of videos. Get exact quotes, timestamps, and the visual frame of what was on screen.

1. Paste a channel & ask anything
Channel Intelligence
Channel: @TechStartupGuru (147 videos indexed)
>
2. We search every video instantly
API Request
POST /search

{
  "channel_id": "a8f3b2c1-...",
  "keywords": ["API key", "authentication", "security", "token"]
}
3. Get exact quotes + visual proof
98.2% match Ep. 34 — "Securing Your API" @ 4:22

"Never store API keys in your frontend code. Always use environment variables and rotate your keys every 90 days. We learned this the hard way when our staging keys got leaked on GitHub..."

Visual Frame @ 4:22
Shows presenter pointing at code editor with .env file open
91.7% match Ep. 12 — "Auth Best Practices" @ 11:05

"Use JWT tokens with short expiry times. Never trust the client. Always validate on the server side..."

~

Whole Channel Indexing

Give us a channel URL. We index every video — transcripts, chunks, embeddings. Search them all instantly.

~

Visual Frame Citations

When a creator says "as you can see here" — we show you the actual frame. Nobody else does this.

Video API

Video Processing API

Upload any video file. Get back extracted frames + transcript in clean JSON. One endpoint. No complex setup. No AWS account needed.

Upload a video
Terminal
$ curl -X POST /extract \
  -H "Authorization: Bearer vapi_..." \
  -F "file=@meeting_recording.mp4"
Get structured JSON back
Response
{
  "job_id": "d9d7f67f-...",
  "total_frames": 70,
  "frames": [
    { "timestamp": 0, "file": "frame_0001.jpg" },
    { "timestamp": 10, "file": "frame_0002.jpg" },
    // ... 68 more frames
  ]
}
~

Frame Extraction

1 frame per 10 seconds, 1280px wide, JPEG. Returned with timestamps.

~

Transcription (BYOK)

Bring your own OpenAI key. We call Whisper, you pay OpenAI directly. $0.006/min.

Use Cases

Built for people who need proof

~

Learn Any Strategy

"What entry criteria does he use?" — search a trading educator's entire channel and get their exact strategy quoted with timestamps.

~

Research & Journalism

"Did they ever mention funding sources?" — search hundreds of videos instantly. Get the exact quote, video, and timestamp.

~

AI Agent Pipelines

Feed video intelligence into AI workflows. Structured JSON designed for agents — frames, transcripts, citations in one call.

~

Due Diligence

"What claims did this influencer make about returns?" — index their channel, search everything, see the frame where they said it.

~

Course Summarisation

Index an entire online course. Ask "how do I set up X?" and get the answer quoted from the exact lesson with a timestamp link.

~

Content Moderation

Scan videos for specific claims, keywords, or visual content. Automated compliance checking across entire channels.

Pricing

Pay for what you use

Free
$0
10 videos free
  • Frame extraction
  • 10 videos/month
  • JSON response
Get Started
Extract
$0.10
per video
  • Frame extraction
  • Unlimited videos
  • JSON + frame URLs
Get Started
Transcribe
$0.25
per video (BYOK)
  • Frames + transcript
  • Unlimited videos
  • Bring your own OpenAI key
Get Started
Get Started

Start extracting intelligence
in 30 seconds

Free tier. No credit card. One API call away from understanding any video.

Free tier includes 10 videos/month. No credit card required.