Trending...
- Rep. Gina H. Curry and Dr. Conan Tu Inspire at Kopp Foundation for Diabetes Hybrid Fundraising Gala and National Leadership Forum
- Paxaterra Global Expands Its Mission to Lead with Soul
- Lt. Governor Primavera Celebrates Taiwan's 114th National Day and Colorado Partnership
First Open-Platform, Video-First SDK for Real-Time Vision AI
BOULDER, Colo. - ColoradoDesk -- Stream, the leading provider of scalable chat, video, and feeds APIs, today announced Vision Agents, the first open-source, open-platform SDK bringing real-time video and audio intelligence into developer applications.
Unlike existing frameworks that bolt video onto voice-first systems, Vision Agents were designed video-first from day one.
"Most frameworks started with voice and later added video," said Thierry Schellenbach, CEO and Co-Founder of Stream. "We built the opposite: a video-first foundation that's open, extensible, and developer-friendly."
Developers can now create AI-powered agents that see, hear, and remember in real time, enabling a new generation of interactive, multimodal applications.
Open Platform for AI Innovation
Vision Agents works with Stream Video by default but also integrates with other video SDKs and supports AI providers, including OpenAI Realtime, Google Gemini, and custom models. This flexibility lets companies adopt Vision Agents without disrupting existing infrastructure, while Stream Video and Chat users gain deep integrations for memory, messaging, and performance.
More on Colorado Desk
Real-Time, Video-First Intelligence
Vision Agents process live video with low latency, enabling real-time perception, scene detection, and natural audio or text responses. Core features include:
Wide-Ranging Applications
Use cases span manufacturing (defect detection), collaboration (AI note-taking, transcription), gaming (coaching, avatars), accessibility (captions, descriptions), and customer support (multimodal assistants).
Open Source and Availability
Fully open-source, Vision Agents invites community contributions to extend providers and tools.
"Vision AI today feels like ChatGPT in 2022, it's just beginning to show what's possible," said Thierry Schellenbach, CEO and Co-Founder of Stream.
Developers and partners can contribute new processors, adapters, and integrations directly on GitHub: https://github.com/GetStream/Vision-Agents
Unlike existing frameworks that bolt video onto voice-first systems, Vision Agents were designed video-first from day one.
"Most frameworks started with voice and later added video," said Thierry Schellenbach, CEO and Co-Founder of Stream. "We built the opposite: a video-first foundation that's open, extensible, and developer-friendly."
Developers can now create AI-powered agents that see, hear, and remember in real time, enabling a new generation of interactive, multimodal applications.
Open Platform for AI Innovation
Vision Agents works with Stream Video by default but also integrates with other video SDKs and supports AI providers, including OpenAI Realtime, Google Gemini, and custom models. This flexibility lets companies adopt Vision Agents without disrupting existing infrastructure, while Stream Video and Chat users gain deep integrations for memory, messaging, and performance.
More on Colorado Desk
- Milwaukee Job Corps Center: Essential Workforce Training—Admissions Now Open
- Aissist.io Launches Hybrid AI Workforce to Solve AI Pilot Failure for Customer Support Automation
- Christy Sports Makes Snowsports More Accessible for Families to Get Outside Together
- MainConcept Completes Management Buyout to Become Independent Company
- LIB Industry Expands Full-Series Salt Spray Corrosion Test Chambers to Meet Global Testing Standards
Real-Time, Video-First Intelligence
Vision Agents process live video with low latency, enabling real-time perception, scene detection, and natural audio or text responses. Core features include:
- Video-first intelligence for scene understanding.
- Real-time audio with transcription, speech, and voice activity detection.
- Memory and context to recall details naturally.
- Action-ready design to connect with external APIs and services.
Wide-Ranging Applications
Use cases span manufacturing (defect detection), collaboration (AI note-taking, transcription), gaming (coaching, avatars), accessibility (captions, descriptions), and customer support (multimodal assistants).
Open Source and Availability
Fully open-source, Vision Agents invites community contributions to extend providers and tools.
"Vision AI today feels like ChatGPT in 2022, it's just beginning to show what's possible," said Thierry Schellenbach, CEO and Co-Founder of Stream.
Developers and partners can contribute new processors, adapters, and integrations directly on GitHub: https://github.com/GetStream/Vision-Agents
Source: GetStream.io
0 Comments
Latest on Colorado Desk
- Colorado: Lt. Governor Primavera Speaks at Greeley Chapter Symposium of the Federation of the Blind
- MetroWest wellness: Holliston farmhouse spa unveils Centerpoint Studio
- Colorado Filmmakers' Short Doc GRINTA! Selected for Prestigious BANFF Mountain Film Festiva
- Cancer Survivor Roslyn Franken Marks 30-Year Milestone with Empowering Gift for Women
- Colorado Springs: Podcast: 'Tis the season for the Pikes Peak Highway
- Featured Course - Photographic Evidence in Discovery
- Are Seasonal Workers in Colorado Covered by Workers' Comp? Yes—But Know Your Rights
- ENERGY33 Successfully Completes Second Engineering & Construction Management Contract for a 27MW STX Cogeneration Power Plant in Honduras
- Florida International University: "Psychiatry: An Industry of Death" Traveling Exhibit Educates Students on Mental Health Abuse
- CCHR: VA's Psychiatric Treatments Betray Veterans, Fuel Suicide and Death
- Integris Composites Named Armor Partner for U.S. Army's XM30 Combat Vehicle
- Governor Polis: Colorado Welcomes the Long-Awaited Return of the Hostages to their Homes and Families, and Cease-fire that Will Hopefully Build Towards Lasting Peace in the Middle East
- Jaipur Countryside, 4-Star Comfort: $199 for Two— All-Inclusive with Meals + Transfers at Heritage Hotel Savista
- Probate Shepherd® Announces a New Member Probate Attorney in Fort Worth, TX
- Phinge Announces "Test the Waters" Campaign for Potential Regulation A+ Offering: Home of Netverse Verified AI & Patented App-less Technology Platform
- Governor Polis Verbally Declares Disaster Emergency for Flooding in Western Colorado
- RJ Grimshaw Launches "The AI EDGE" A Practical Guide Where Leadership Meets Innovation
- Probate Shepherd® Announces a New Member Probate Attorney in Sugar Land, TX
- Live Good Leads with Love: Creating Opportunity, Protecting the Vulnerable and Inspiring Hope
- Probate Shepherd® Announces a New Member Probate Attorney in The Woodlands, TX