Changelog January 2025: New Functions — Visual Under­standing & Threat Detection

Following a signif­icant update in december 2024 to the Deep Live Hub of our DeepVA composite AI platform last year, we are pleased to kick off the year with two brand new AI services for the Deep Media Analyzer.

New service: Visual Under­standing

The Visual Under­standing Module is a recent addition to Aiconix’s Deep Media Analyzer, designed to enhance the analysis of images and videos through prompt-based visual language under­standing. Designed for users who require a deeper under­standing of their visual content, this module comple­ments our already powerful zero-shot object and scene recog­nition with its customization.

Perform vision-language under­standing tasks such as visual question answering, scene compre­hension and advanced reasoning. Try it now!

Key Features

Visual Language Compre­hension: Inter­prets visual elements such as symbols, images, and layouts to provide insights without relying on audio infor­mation.

Prompt-Based Analysis: Users input specific prompts to guide the analysis, enabling tasks like scene description, content summa­rization, emotion and tone analysis, highlights extraction and audience engagement insights.

Since it is based on a prompt, there are many different use cases possible, here are some example prompts:

  • Scene Description

    “Describe the actions happening in this video scene.”

  • Content Summa­rization

    “Summarize the key events in this video.”

  • Emotion and Tone Analysis

    “What is the emotional tone of this scene?”

  • Highlights Extraction

    “Find emotional key scenes in this video.”

  • Pattern Recog­nition

    “What visual elements are most frequently focused on?”

The results are returned in as text together with the initial prompt. This Module is available for all DeepVA Users and we will be adding further features and parameters to this module in the next releases.

New feature: Threat Detection

Designed primarily for security appli­ca­tions such as monitoring of large sites or buildings, this module can be integrated into workflows via an API for automated threat detection and response. It is a visual language under­standing model, pre-trained to recognize dangerous situa­tions such as violence, medical issues or hazards such as fire, to provide faster alerts to CCTV operators.

The results are returned in a clear yes or no indicator, along with a written expla­nation of why the system reached that judgement. This allows a human operator to check the video resource more quickly and take action or call for further assis­tance if required. This module is available to all DeepVA users and is ideally used via the API.

Minor Improve­ments: Increased Usability

  • We change the default model for our Object and Scene Recog­nition in the Visual Mining Wizard to the Zero-Shot model, which is our state-of-the-art model and can even be customised using dictio­naries.
  • We refac­tored the Help Center page in our API in order to provide faster and better guidance, including the linking of our new Knowledge Hub and Support Form.
  • Improved UI of speaker dataset label.

Updates for the Deep Live Hub

In December 2024 Changelog, we already intro­duced signif­icant updates to the Deep Live Hub of our composite AI platform, DeepVA, enhancing usability, flexi­bility, and security.

Highlights include the integration of multi-streaming, enabling simul­ta­neous live streaming to multiple platforms such as YouTube Live, Facebook Live, and Twitch without requiring external tools like Restream IO. Other additions include reusable subtitle settings, enhanced security through RTMPS, and improve­ments in stability and error handling.

This week we further improved the Deep Live Hub by adding:

SRT (Secure Reliable Transport) Push & Input

Further enhanced security through RTMPS and SRT (Secure Reliable Transport), for more details see our last changelog.

Loadbalacing

We also improved the load balancing to deliver good results with a lower latency Switching the load balancing for lower latency.

Roadmap

Next month we will be releasing new features including deeper text under­standing and more functions for the Visual Under­standing module, all tied to the ability to combine the modules of our Composite AI platform to achieve higher levels of automation. Stay tuned!

All DeepVA changelog updates are available here: https://docs.deepva.com/changelog/

Share

Email
LinkedIn
Facebook
Twitter
Search

Table of Contents

latest AI news

Subscribe to our newsletter

Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!