
Automatically build a speaker dataset
How can I build up a dataset for my speaker recognition?
use case: Build speak dataset automatically
the challenge
There is a great deal of visual information in content that can’t be systematically mapped or stored traceably, as well as information about speakers at the audio level that is lost and doesn’t appear in the metadata. How can DeepVA support me in extracting speakers from my material in order to better structure my archive and make it searchable?
the solution
The DeepVA Deep Colletor can be used to automatically create speaker datasets. This is done by reading out the lower thirds that contain the speaker’s name. If there is material that shows Barack Obama speaking, for example, and his name appears underneath, this information is automatically linked and transferred into a separate speaker dataset. The system can be used on-premise or in the cloud. If it is to be part of a workflow, integration is required. Via our RESTful API, it can be easily integrated in any existing system or workflow. Data protection requirements usually play a major role in this decision and should be considered.
What results can be obtained?
The AI recognizes the display of a lower third
Training data from the audio track is stored together with the speaker's name and any additional information in a database
Automated creation of a unique speaker database
Constant expansion of the training data by automatic data addition
faster data acquisition
COST REDUCTION
faster labelling
Automatically build a speaker dataset
Using Face Index, each person in video and image material can be assigned a number, allowing them to be translated into the metadata afterwards.
Contact us
Enter your email below and we will get back to you as soon as possible.
Related use cases
How can I build up a dataset for my speaker recognition?
How can I efficiently find what I’m really looking for in my visual media material?
How can I recognize people and objects more quickly and establish references between them?
The project has indirectly received funding from the European Commission’s Horizon 2020 Framework Programme through the STADIEM project (Grant Agreement 957321)
don't miss the latest news!
Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!
Cookie | Duration | Description |
---|---|---|
cookielawinfo-checbox-analytics | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics". |
cookielawinfo-checbox-functional | 11 months | The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional". |
cookielawinfo-checbox-others | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other. |
cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-performance | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance". |
viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |