CUSTOMER SUCCESS STORY / Bayerischer Rundfunk

Creation and management of training data

br_bayerischer rundfunk

INDUSTRY

Public Broad­casting Service

PRODUCT

BENEFITS

Struc­turation and index­ation of archival collec­tions and, as a result, optimization and accel­er­ation of workflows.

Creating and managing training data for face recog­nition using AI requires a lot of time and resources.

DeepVA was able to automate this process and save 86% of costs while achieving 99.32% accuracy.

Since 1949, Bayerischer Rundfunk stands for respon­sible journalism providing quali­tative content in the areas of enter­tainment, infor­mation, culture and education. Eight million people use the range of content of the public broad­caster daily either online, via TV or apps. BR has strong regional roots in Bavaria, and thus also offers local content that inspires and fasci­nates people. Bayerischer Rundfunk is the most important news provider in Bavaria and reports about all relevant events, current affairs and recent devel­op­ments in programs like Rundschau or Abend­schau.

Challenge: Manual training-data management is too expensive and takes too much time

The increasing demand and output of video material in the last few years made it necessary to use machine learning and deep learning algorithms to increase the efficiency of media management and production. Artificial intel­li­gence is used, among other things, to enrich image and video material with metadata in the area of face recog­nition. The process of watching and manually labeling images or videos is no longer necessary. Such AI models can only perform well if associated training datasets are kept up-to-date and maintained frequently. However, pretrained AI models and exter­nally supplied training material of typical providers of recog­nition services do not meet the require­ments of a TV broad­caster like the BR, who predom­i­nantly includes local celebrities, historical material or German language in its supplied content. Gener­ating adequate training data in-house requires human resources, is time-consuming and cost intensive.

For BR, the question arose if its own archived material of news broad­casts could be used to automate the creation of training data for facial recog­nition and thus creating a customized and constantly evolving training data set without exorbi­tantly high staff costs.

Solution: Face Dataset Creation by DeepVA
- an automated training data creation.

To meet this challenge, BR approached the Freiburg-based AI startup DeepVA, and together they were able to develop an automated solution for the creation of training data in the area of face recog­nition. With the help of the so-called Face Dataset Creation, video material already published by BR is analyzed with regard to name inser­tions. Names of person­al­ities are usually displayed in news and interview scenes. As a rule, the angle, quality and other condi­tions are suffi­cient for sample images that can be used as training data for the AI.

Ultimately, the infor­mation from the name overlay is extracted along with the associated face and stored in a dataset, which can be used to train an AI model. This way, large training datasets can be created without manual effort, resulting in AI models that can better meet company-specific require­ments and outperform out-of-the-box recog­nition services.

Result: Signif­icant time savings with excep­tional accuracy

As part of the cooper­ation between BR and DeepVA, 641 videos of news broad­casts were analyzed using Face Dataset Creation for test purposes in addition to already ongoing productive operation. An accuracy of 96.27% was achieved when reading out the name inser­tions. The time required for this automated training data creation was compared to the manual approach. If the same amount of data were provided manually (finding video segments where names are inserted; manually storing names; extracting the matching faces from the respective images; and storing all this infor­mation), it would take an employee 17 weeks based on a 40-hour work week. Automating this process with DeepVA resulted in a time investment of less than 4 days for overall 300 hours of video footage. This repre­sents a time saving of 86%, which could even be increased with a more powerful server structure. The AI models that were trained based on this data achieved an accuracy of 99.32% in the recog­nition of person­al­ities. The collab­o­ration between Bayerischer Rundfunk and DeepVA shows that it is possible to automate the creation of company-specific training data for face recog­nition and delivers amazing results in terms of time and cost savings. With the help of this technology, media companies can create extensive data sets and benefit from individual AI models to better structure their exclusive media archive and make content more manageable.

0 %

ACCURACY

100 %

NEED OF AI EXPERTS

0 %

FASTER LABELLING

Take a look at our other success stories

Vidispine

With the help of DeepVA, the user can intuitively and easily create and manage training data directly in the media asset management system (MAM).

Read More »

Heilbronn

DeepVA helps the Heilbronn City Archive to automat­i­cally index and digitize a large amount of images.

Read More »

Expert solutions, tailored to your needs

Ready to unlock the potential of AI for your business? Try DeepVA free of charge for 14 days!

We Are Supported By

latest AI news

Subscribe to our newsletter

Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!