Artificial intelligence in the media supply chain


IT Company



Enhancement of a media management system with automatic image and video recognition as well as an integrated feature that allows creating individual AI models in no time.

Press release

With the help of DeepVA, the user can intuitively and easily create and manage training data directly in the media asset management system (MAM).

This greatly increases identification performance in the field of face recognition.

Get in touch

With our expertise, we will help you find a perfect solution for your needs!


Arvato Systems is an internationally active IT specialist and multi-cloud service provider whose mission is to support various companies in their digital transformation. Fast and secure IT systems provide Arvato’s customers easy access to the cloud and ensure optimal connected applications and business processes with the help of IoT, blockchain or artificial intelligence.

With VidiNet, Arvato Systems offers a cloud-based media service platform of the Arvato Systems brand Vidispine, which constitutes a highly efficient basis for the entire content chain with its numerous applications. It offers users a range of services and apps in a pre-integrated environment as SaaS solutions. Thus, the platform not only supports various media workflows, but also allows any scaling options for professional use.

Cooperation with Vidinet: AI meets MAM-System

The cooperation of our AI software with the Arvato Systems Vidispine team turned out to be a complete success. From the beginning, both we and the team behind Arvato Systems were fascinated by the idea of automating media workflows with the most advanced tools from the IT toolbox, namely AI in the field of computer vision. The user of the media asset management system should be able to monitor and control the recognition of content from images and video, the creation of their own AI models and the quality of their training data themselves.

Challenge: Custom AI solutions are expensive and often require expert knowledge

The implementation of prefabricated AI models in the field of computer vision rarely results in the hoped-for added value in practice. While integration into a MAM can be straightforward and fast, recognition performance is severely limited. For example, most organizations additionally need to recognize people in images and videos that are not part of an existing AI model. To achieve this, models must be constantly expanded, updated and adapted. The necessary tools for this must be made transparently available to the user directly in the MAM,

because developing your own solutions to adjust AI models to your own company’s needs is usually expensive and requires expert knowledge in the field of machine learning.

Our experience with generic AI solutions is that the required entities are often not identified in one's own content. Creating training data and managing datasets is complex and time-consuming. DeepVA's integration with our media service has shown us that customizable AI within the already familiar MAM interface can be intuitive and straightforward.

Ralf Jansen Vidispine

Ralf Jansen

Software Architect at Vidispine – An Arvato Systems Brand

Solution: Own AI models directly in the MAM System

DeepVA, in contrast to other business organizations in the field of computer vision, offers the possibility to automatically create own AI models, resulting in more individual content that can be recognized from media assets compared to “pre-trained models”.

Vidispine Arvato Systems Interface with AI

Result: Highly efficient and time-saving facial recognition

The cooperation between DeepVA and VidiNet allows for building individual AI models in the MAM system without any prior technical knowledge. Due to the growing amount of available training data, which is considered the gold of the data-driven age for good reason, the performance of person recognition is increasing rapidly. This results in a more detailed and higher quality tagging of images and videos and as a result in an improved searchability of all content. In addition, face indexing (assigning a unique ID to unrecognized persons) enables further analyses and reverse searches. This optimizes workflows in the MAM and greatly saves time and costs. Employees must no longer spend their time on monotonous and repetitive manual tagging of media content, but instead create an intelligent and constantly improving visual data management system with just a few clicks.

DeepVA, VidiNet, and the accompanying VidiNet Cognitive Services, create a highly integrated MAM ecosystem that allows users to build their own AI models in a simple and intuitive way. With the integration of DeepVA in VidiNet, the required training data can be efficiently created and managed. Sample data can be provided manually or cut and labeled directly from videos using a tool. Face Dataset Creation even allows to automatically generate appropriate training data, by extracting and saving sample images from interview scenes. In addition to the management of training data, the data can also be automatically checked for usability for custom AI models.

0 %


100 %


0 %


Take a look at our other success stories

Partnerschaft mit Heilbronn Stadtarchiv


DeepVA helps the Heilbronn City Archive to automatically index and digitize a large amount of images.

Read More »
DeepVA Landmark Recognition


rbb’s collaboration with DeepVA transformed regional broadcasting and archiving through efficient AI utilization.

Read More »

With our expertise, we will help you find a perfect solution for your needs!

We Are Supported By

latest AI news

Subscribe to our newsletter

Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!

🎉 Introducing Deep Live Hub: Your All-in-One Live Streaming Solution!  
This is default text for notification bar