Stream, the leading provider of scalable chat, video, and feeds APIs, announced Vision Agents, the first open-source, open-platform SDK bringing real-time video and audio intelligence into developer applications.
Unlike existing frameworks that bolt video onto voice-first systems, Vision Agents were designed video-first from day one.
Marketing Technology News: MarTech Interview With Chris Golec, Founder and CEO at Channel99
“Most frameworks started with voice and later added video,” said Thierry Schellenbach, CEO and Co-Founder of Stream. “We built the opposite: a video-first foundation that’s open, extensible, and developer-friendly.”
Developers can now create AI-powered agents that see, hear, and remember in real time, enabling a new generation of interactive, multimodal applications.
Open Platform for AI Innovation
Vision Agents works with Stream Video by default but also integrates with other video SDKs and supports AI providers, including OpenAI Realtime, Google Gemini, and custom models. This flexibility lets companies adopt Vision Agents without disrupting existing infrastructure, while Stream Video and Chat users gain deep integrations for memory, messaging, and performance.
Real-Time, Video-First Intelligence
Vision Agents process live video with low latency, enabling real-time perception, scene detection, and natural audio or text responses. Core features include:
Wide-Ranging Applications
Use cases span manufacturing (defect detection), collaboration (AI note-taking, transcription), gaming (coaching, avatars), accessibility (captions, descriptions), and customer support (multimodal assistants).
Open Source and Availability
Fully open-source, Vision Agents invites community contributions to extend providers and tools.
“Vision AI today feels like ChatGPT in 2022, it’s just beginning to show what’s possible,” said Thierry Schellenbach, CEO and Co-Founder of Stream.
Marketing Technology News: Marketing at the Speed Of Behavior: Why Martech Must Catch Up With Customers?
The post Stream Launches Vision Agents First Open-Platform, Video-First SDK for Real-Time Vision AI first appeared on PressReleaseCC.
Stream Launches Vision Agents First Open-Platform, Video-First SDK for Real-Time Vision AI first appeared on Web and IT News.
TempraMed Announces Continuance into Ontario Vancouver, British Columbia–(Newsfile Corp. – April 29, 2026) – TempraMed…
NEW YORK, N.Y., April 29, 2026 (PRESSRELEASECC.COM NEWSWIRE) — The demand for smarter and more…
Tickets Now on Sale for The 58th Bell Ringer Awards Ceremony Boston, Massachusetts–(Newsfile Corp. –…
ZenaTech Files Early Warning Report Pursuant to National Instrument 61-103 Vancouver, British Columbia–(Newsfile Corp. –…
HIVE Digital Announces Closing of Private Offering of US$115 Million of 0% Exchangeable Senior Notes…
ImagineAR Inc. Voluntarily Withdraws Common Shares from OTCQB Venture Market Vancouver, British Columbia–(Newsfile Corp. –…
This website uses cookies.