PARTNERSHIP / Vidispine

Vidispine Arvato Systems Interface with AI

About Vidispine

Vidispine, part of Arvato Systems, delivers flexible media management solutions for the media and enter­tainment industry. The VidiNet platform enables efficient video content handling—on-premises or in the cloud—helping customers turn content into value.

DeepVA product used

Benefits

Links

Update 09.2025: From ingest to rough cut in seconds.

Together with our partner Vidispine, we’re excited to show how their AI editing assistant helps creators speed up workflows — from ingest to rough cut in no time — so they can focus on what really matters: producing amazing content.

Want to see it live at IBC?
📍 Vidispine – Booth 7.A15
📍 DeepVA – Booth 3.B48D

👉 Take a look at our LinkedIn post with a short trailer

 

DeepVA Face Training Theme for VidiCore

The DeepVA Face Training Theme for VidiCore is an appli­cation designed to showcase the capabil­ities of the DeepVA Face Recog­nition & Analysis Services within VidiNet. It includes features for analyzing video content using DeepVA AI services in VidiNet, managing and organizing the face data model / database with VidiCore, training the data model with known faces before analysis, manually capturing faces from videos, and more.

VidiCore’s powerful indexing services make it easy to locate and display people and faces by combining AI with time-coded metadata. At its core, the Face Training Theme is a light­weight appli­cation that requires minimal config­u­ration. The only prereq­uisite is a live VidiCore instance with S3 storage, a connected VidiCoder, and one or more linked DeepVA services.

Update 2025: New function­ality!

Content Moder­ation for automated content evalu­ation, Visual Under­standing for prompt-based video analysis, and Highlight Clipping for automatic extraction of key scenes.

DeepVA Services in VidiNet

TheFace Training and Analysis Powered by DeepVA” service enables DeepVA’s advanced AI capabil­ities to analyze video content and detect both known and unknown faces. Each detected face is finger­printed and compared, with dupli­cates grouped under a single finger­print. All findings retain the timecode of where the face was detected, allowing for accurate display of search results in the user interface. This service also allows pre-training of the face model before processing large volumes of content.

The “Face and Label Extractor Powered by DeepVA” service performs a detailed frame-by-frame analysis to detect faces in video content and match them with textual infor­mation found in the lower third area of the screen. Unlike the other service, analysis results from this one do not preserve the timecode of face detection. However, the output can be used to pre-train the face data model for use with the “Face Training and Analysis Powered by DeepVA” service.

The output from the “Face and Label Extractor Powered by DeepVA” is essen­tially a set of still images of individuals’ faces, accom­panied by suggested names based on detected lower third text.

Vidispine Media Portal

Vidispine Media­Portal is a web interface for accessing all media and non-media files managed by Vidispine, and provides function­ality for collecting and processing this content within a single search logic and intuitive design. It simplifies your workflows for uploading, house­keeping, editing projects, and distrib­uting content. As a highly integrated part of the Vidispine product portfolio, you can easily customize the appli­cation to your specific needs when it comes to production workflows, media supply chains, and collab­o­rative work via a conve­nient config­u­ration interface.

VidiCore

VidiCore is a Media Management backend service that sees its placement in the base of layer a media supply system. As implied, at the core of your system, its essential role is to handle, and ultimately reduce, the complexity of connecting multiple sources as well as the harvesting and management of media assets and their respective metadata. Coupled together with VidiCoder, VidiCore is able to provide media awareness and proxy gener­ation right out of the box.

Another aspect which grants VidiCore a high degree of flexi­bility is the fact that it is a REST API service. This enables one to easily develop custom solutions on top of it. VidiCore malleable nature allows one to fit it into various appli­cation scenarios: From filling a gap in your existing media workflow, to building sophis­ti­cated media management solutions. VidiCore’s archi­tecture is kept simple. It can easily be deployed on an on-prem, or in cloud environment as well as utilizing vendor specify compo­nents for easy operation and scala­bility.

Benefits for the user

Automatic face finger­printing

Detected faces are automat­i­cally stored in the DeepVA data model for subse­quent analysis

Efficient search function­ality

Easily locate individuals in your media assets

Face management

The software allows you to effec­tively manage and categorize the faces detected by the DeepVA Face Analysis and Training service

DeepVA functions

DeepVA features integrated into Vidispine

Content moder­ation

Automate video content moder­ation with AI. Rate and classify media for violence, nudity, and substance use.

Visual under­standing

Analyze video and image content using customizable AI prompts and perform visual question answering.

Audio transcription

Never miss a word. Our Speech to Text function ensures every­thing spoken is accurately captured in writing.

Face recog­nition

Detect and identifiy the faces of public figures in a variety of categories such as politics, sports, business, and enter­tainment.

 

Landmark recog­nition

Identify all major sights, archi­tec­tural struc­tures, and natural monuments across Europe and North America.

 

Custom AI model training

Train your own AI models for faces, voices, and landmarks — no code needed, no data shared.

Face and speaker dataset creation

Automat­i­cally generate entire datasets using facial images, text infor­mation (e.g., from lower thirds), and audio elimi­nating the need for manual labor.

Face index

With Face Index, each individual in video and image material can be assigned a unique identifier, facil­i­tating their trans­lation into metadata after­wards.

Expert solutions, tailored to your needs

Ready to unlock the potential of AI for your business? Try DeepVA free of charge for 14 days!

latest AI news

Subscribe to our newsletter

Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!