use case: Build speak dataset automatically
There is a great deal of visual information in content that can’t be systematically mapped or stored traceably, as well as information about speakers at the audio level that is lost and doesn’t appear in the metadata. How can DeepVA support me in extracting speakers from my material in order to better structure my archive and make it searchable?
The DeepVA Deep Colletor can be used to automatically create speaker datasets. This is done by reading out the lower thirds that contain the speaker’s name. If there is material that shows Barack Obama speaking, for example, and his name appears underneath, this information is automatically linked and transferred into a separate speaker dataset. The system can be used on-premise or in the cloud. If it is to be part of a workflow, integration is required. Via our RESTful API, it can be easily integrated in any existing system or workflow. Data protection requirements usually play a major role in this decision and should be considered.
What results can be obtained?
faster data acquisition
Automatically build a speaker dataset
Using Face Index, each person in video and image material can be assigned a number, allowing them to be translated into the metadata afterwards.
Enter your email below and we will get back to you as soon as possible.
Related use cases
don't miss the latest news!
Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!