Vision-Language Model (VLM) for Semantic Video Detection and Search
What if your video management system could instantly recognize complex scenes just by describing them in plain English? With Axxon One Vision-Language Model (VLM), users can set up real-time detection of objects and events using intuitive natural-language queries.
VLM in Axxon One also supports retrospective search. When this feature is enabled, the system stores frame descriptions generated by the neural network during video recording and makes them available for later search. This allows users to work with live detection and archive search within one AI-powered workflow.
Real-Time Semantic Detection with VLM
Simply describe what you want to detect — whether it’s “a running human” or “a woman wearing a black dress.” The AI-powered Meta-Detector VLM processes the query and identifies matching scenes as they appear in the video feed, offering seamless and precise scene detection. In search mode, the user enters a text query, and the system translates it into the metadata representation used by the neural network to find the closest matches.
VLM Bridges Human Intent and Machine Execution
Instead of configuring complex rules, you simply describe what you need, and the system does the rest. This makes video analytics more accessible, more flexible, and easier to adapt to changing operational needs.
In live mode, Meta-Detector VLM works as an online tool: the user enters a text query and receives an alarm when that query matches the frame description in real time.
For retrospective search, users can enable metadata storage on the Meta-Detector side. This makes the frame descriptions available for archive search later, without requiring any detectors to be configured in advance. Search is performed across all recorded metadata generated by the neural network during video recording.
Examples of Detection
- Detect “a package left at the entrance”.
- Identify “a person climbing a fence near the loading dock”.
- Recognize “a white car at a crosswalk from a top view”.
- Search archived footage for “person wearing pink”.
Meta-Search in Recorded Video
Once metadata storage is enabled, users can open the search tab, select Meta-Search VLM, and work with it in the same intuitive way as in detector settings. They enter a text query, and the system returns frames and video fragments whose stored metadata best matches that request.
Users can also set similarity in percent to refine search results. This makes it possible to broaden or narrow the match depending on how specific the query is and how many relevant results are needed.
Meta-search does not generate events by itself. Its role is to use previously stored metadata for retrospective retrieval. If dedicated detectors are already configured and generating events in real time, then search can be performed through those events instead.
The Value of VLM in Axxon One VMS
AI-driven Vision-Language Model capabilities transform traditional surveillance systems into more intelligent, more flexible video management platforms. By combining Meta-Detector VLM for live semantic detection with Meta-Search VLM for archive retrieval, Axxon One enables:
- Real-time scene understanding for live video streams.
- Faster response times to security threats.
- Improved accuracy in identifying complex scenes.
- Less time spent on manual review and configuration.
By allowing users to interact with both live and recorded video through human-readable descriptions, Axxon One VLM provides an effective approach to real-time scene understanding and AI-powered archive search. This extends video analytics from event detection toward semantic video understanding inside a full VMS workflow.
Explorez les spécifications techniques complètes du logiciel Axxon One VMS, incluant les fonctions principales, les fonctionnalités spéciales, la vidéo analytique, ainsi que les appareils et standards pris en charge.
Téléchargez la version PDF de la présentation complète d’Axxon One, contenant des informations détaillées sur notre logiciel avancé de gestion vidéo.
Accédez à des informations générales sur les fonctionnalités et technologies d’Axxon One VMS dans un format pratique.