PARTNERSHIP / Vidispine

About Vidispine

Vidispine, part of Arvato Systems, delivers flexible media management solutions for the media and entertainment industry. The VidiNet platform enables efficient video content handling—on-premises or in the cloud—helping customers turn content into value.

DeepVA product used

Benefits

Update 09.2025: From ingest to rough cut in seconds.

Together with our partner Vidispine, we’re excited to show how their AI editing assistant helps creators speed up workflows — from ingest to rough cut in no time — so they can focus on what really matters: producing amazing content.

Want to see it live at IBC?
📍 Vidispine – Booth 7.A15
📍 DeepVA – Booth 3.B48D

👉 Take a look at our LinkedIn post with a short trailer

DeepVA Face Training Theme for VidiCore

The DeepVA Face Training Theme for VidiCore is an application designed to showcase the capabilities of the DeepVA Face Recognition & Analysis Services within VidiNet. It includes features for analyzing video content using DeepVA AI services in VidiNet, managing and organizing the face data model / database with VidiCore, training the data model with known faces before analysis, manually capturing faces from videos, and more.

VidiCore’s powerful indexing services make it easy to locate and display people and faces by combining AI with time-coded metadata. At its core, the Face Training Theme is a lightweight application that requires minimal configuration. The only prerequisite is a live VidiCore instance with S3 storage, a connected VidiCoder, and one or more linked DeepVA services.

Update 2025: New functionality!

Content Moderation for automated content evaluation, Visual Understanding for prompt-based video analysis, and Highlight Clipping for automatic extraction of key scenes.

DeepVA Services in VidiNet

The “Face Training and Analysis Powered by DeepVA” service enables DeepVA’s advanced AI capabilities to analyze video content and detect both known and unknown faces. Each detected face is fingerprinted and compared, with duplicates grouped under a single fingerprint. All findings retain the timecode of where the face was detected, allowing for accurate display of search results in the user interface. This service also allows pre-training of the face model before processing large volumes of content.

The “Face and Label Extractor Powered by DeepVA” service performs a detailed frame-by-frame analysis to detect faces in video content and match them with textual information found in the lower third area of the screen. Unlike the other service, analysis results from this one do not preserve the timecode of face detection. However, the output can be used to pre-train the face data model for use with the “Face Training and Analysis Powered by DeepVA” service.

The output from the “Face and Label Extractor Powered by DeepVA” is essentially a set of still images of individuals’ faces, accompanied by suggested names based on detected lower third text.

Vidispine Media Portal

Vidispine MediaPortal is a web interface for accessing all media and non-media files managed by Vidispine, and provides functionality for collecting and processing this content within a single search logic and intuitive design. It simplifies your workflows for uploading, housekeeping, editing projects, and distributing content. As a highly integrated part of the Vidispine product portfolio, you can easily customize the application to your specific needs when it comes to production workflows, media supply chains, and collaborative work via a convenient configuration interface.

VidiCore

VidiCore is a Media Management backend service that sees its placement in the base of layer a media supply system. As implied, at the core of your system, its essential role is to handle, and ultimately reduce, the complexity of connecting multiple sources as well as the harvesting and management of media assets and their respective metadata. Coupled together with VidiCoder, VidiCore is able to provide media awareness and proxy generation right out of the box.

Another aspect which grants VidiCore a high degree of flexibility is the fact that it is a REST API service. This enables one to easily develop custom solutions on top of it. VidiCore malleable nature allows one to fit it into various application scenarios: From filling a gap in your existing media workflow, to building sophisticated media management solutions. VidiCore’s architecture is kept simple. It can easily be deployed on an on-prem, or in cloud environment as well as utilizing vendor specify components for easy operation and scalability.

Benefits for the user

Automatic face fingerprinting

Detected faces are automatically stored in the DeepVA data model for subsequent analysis

Efficient search functionality

Easily locate individuals in your media assets

Face management

The software allows you to effectively manage and categorize the faces detected by the DeepVA Face Analysis and Training service

DeepVA functions

DeepVA features integrated into Vidispine

Content moderation

Automate video content moderation with AI. Rate and classify media for violence, nudity, and substance use.

Learn more

Visual understanding

Analyze video and image content using customizable AI prompts and perform visual question answering.

Learn more

Audio transcription

Never miss a word. Our Speech to Text function ensures everything spoken is accurately captured in writing.

Learn more

Face recognition

Detect and identifiy the faces of public figures in a variety of categories such as politics, sports, business, and entertainment.

Learn more

Landmark recognition

Identify all major sights, architectural structures, and natural monuments across Europe and North America.

Learn more

Custom AI model training

Train your own AI models for faces, voices, and landmarks — no code needed, no data shared.

Learn more

Face and speaker dataset creation

Automatically generate entire datasets using facial images, text information (e.g., from lower thirds), and audio eliminating the need for manual labor.

Learn more

Face index

With Face Index, each individual in video and image material can be assigned a unique identifier, facilitating their translation into metadata afterwards.

Expert solutions, tailored to your needs

Ready to unlock the potential of AI for your business? Try DeepVA free of charge for 14 days!

Our AI applications

Deep Media Analyzer

Deep Model Customizer

Deep Collector

Deep Live Hub

Deep Indexer

Deep Explorer

by solution

by customer story

Customer Success Story: Zebra Live meets European Publishing Congress

About Vidispine

DeepVA product used

Benefits

Links

Update 09.2025: From ingest to rough cut in seconds.

DeepVA Face Training Theme for VidiCore

Update 2025: New functionality!

DeepVA Services in VidiNet

Vidispine Media Portal

VidiCore

Benefits for the user

Automatic face fingerprinting

Efficient search functionality

Face management

DeepVA features integrated into Vidispine

Content moderation

Visual understanding

Audio transcription

Face recognition

Landmark recognition

Custom AI model training

Face and speaker dataset creation

Face index

Expert solutions, tailored to your needs

DeepVA

Product

Functions

Resources

Our AI applications

Deep Media Analyzer

Deep Model Customizer

Deep Collector

Deep Live Hub

Deep Indexer

Deep Explorer

by solution

by customer story

About Vidispine

DeepVA product used

Benefits

Links

Update 09.2025: From ingest to rough cut in seconds.

DeepVA Face Training Theme for VidiCore

Update 2025: New function­ality!

DeepVA Services in VidiNet

Vidispine Media Portal

VidiCore

Benefits for the user

Automatic face finger­printing

Efficient search function­ality

Face management

DeepVA features integrated into Vidispine

Content moder­ation

Visual under­standing

Audio transcription

Face recog­nition

Landmark recog­nition

Custom AI model training

Face and speaker dataset creation

Face index

Expert solutions, tailored to your needs

DeepVA

Product

Functions

Resources

Subscribe to our newsletter

Update 2025: New functionality!

Automatic face fingerprinting

Efficient search functionality

Content moderation

Visual understanding

Face recognition

Landmark recognition