Tüm roadmap'e dön
topicadvanced★ Pro
Video Anlama
Gemini native video + Twelve Labs — frame extraction + temporal QA + scene segmentation.
3 saat2 kaynak1 önkoşul
Gemini 2.x native video input — 1 saatlik video direkt context'e. Saniye-seviye soru sor: "10:32'de ne oluyor?"
Alternatif: Frame extraction (1fps ya da scene-change) → vision LLM'e batch. Daha pahalı, daha esnek.
Twelve Labs: dedicated video AI — semantic search, scene segmentation, key moment detection.