AI model training that fits your content

Training AI models doesn’t have to be a complex process locked behind layers of code and data science jargon. With DeepVA, you can train your own AI model — faces, voices, landmarks — using the media you already have.

custom face training

Train faces you know. Recognize the people who matter.

With DeepVA’s Face Training model, identi­fying familiar faces across your video archives becomes effortless. Just upload a few clips or images — the system handles the rest. It crops, detects, and starts training your custom model using DeepVA’s Deep Model Customizer.

  • Custom facial recog­nition tailored to your needs

    Train your own facial recog­nition models using company-specific data — from internal stake­holders to regular guests or local public figures.

custom speaker training

Give voice to your content: Speaker Training

Need to attribute speech to the right person? The Speaker Training model has your back. Upload labeled audio files — short clips, inter­views, even podcast excerpts — and DeepVA builds speaker recog­nition models that actually know who’s talking.
  • Custom speaker identi­fi­cation models for your organi­zation

    Train AI models to recognize specific voices — such as presenters, execu­tives, or recurring guests — based on your own audio or video content.

custom landmark training

Places have voices too: Landmark Training

Some stories are tied to where they happened. Landmark Training helps you train AI models that spot specific locations — like city halls, cultural landmarks, or branded spaces — straight from your own footage or imagery.

  • Custom recog­nition of locations, buildings, and environment

    Train AI models to recognize company-specific, regional, or histor­i­cally relevant landmarks based on your own image or video data.

Dataset evalu­ation

AI model evalu­ation

With DeepVA you can automat­i­cally evaluate the quality of your training data for face and landmark recog­nition. The system checks factors like image resolution, relative size, and positioning of relevant features—providing clear feedback to help you improve your dataset before training. This ensures more accurate and reliable AI model results, without requiring any machine learning expertise.

Custom AI model training module is part of our Deep Media Customizer appli­cation. Check it out now: 

Deep Model Customizer

Create custom AI models with ease

The benefits of DeepVA custom AI model training

Adapt the AI to your needs

Tailor recog­nition models for faces, voices, and landmarks to reflect company-specific, regional, or contextual needs.

No-code training interface for fast deployment

Quickly build custom AI models through a no-code interface – even with very limited training data, thanks to few-shot learning.

Seamless integration with media workflows

Trained models can be deployed immedi­ately across the DeepVA ecosystem or integrated with third-party systems via API for consistent cross-platform recog­nition.

Real-world success stories

DeepVA Landmark Recognition
Broad­caster

rbb

rbb’s collab­o­ration with DeepVA trans­formed regional broad­casting and archiving through efficient AI utilization.

Read More »
City archive

Heilbronn

DeepVA helps the Heilbronn City Archive to automat­i­cally index and digitize a large amount of images.

Read More »

frequently asked questions

Have a question? We’ve got answers

What is AI Model Training?

AI model training is the process of teaching an artificial intel­li­gence system how to recognize specific patterns — like faces, voices, or landmarks — using examples that you provide. You upload your own media (images, video clips, audio), label what’s relevant, and the system learns to identify those patterns in future content.

But here’s something important: when you train an AI model with DeepVA, your data stays yours. We don’t store, reuse, or learn from your training data — ever. The models you train are isolated to your organi­zation, and all data handling is fully GDPR-compliant. No hidden training on your content, no surprise usage, and no shared datasets behind the scenes.

It’s not just smart — it’s secure. You’re in full control of your training process, your media, and your results. That means you get a custom AI model that truly fits your world — without compro­mising privacy or ownership.

How many identities can I train in a single model?

You can train up to several thousand individual identities, depending on the dataset and desired accuracy. The model scales flexibly.

How much data is required to train a person’s face?

As little as 5–10 high-quality images per person can yield strong results. More data improves accuracy and confi­dence scores.

What kinds of landmarks can be trained and how much training data is needed?

Anything visually consistent and identi­fiable — including buildings, monuments, landscapes, and even brand-specific environ­ments or interior spaces.

Typically, 20–50 high-quality images per landmark provide a solid training base. More images improve the precision and confi­dence score of detection.

How many speakers can I train in a single model?

The system supports hundreds of trained speaker profiles. Larger-scale deploy­ments can be customized based on your infra­structure and perfor­mance needs.

How much audio is needed to train a speaker? How many speakers can I train in a single model?

Around 2–5 minutes of clear speech per person is suffi­cient for reliable training. More data improves confi­dence and accuracy.

Can I combine Speaker Training with transcription tools?

Yes. The model integrates seamlessly with transcription workflows, attributing each segment to the correct speaker for cleaner, more struc­tured output — you can read more about audio transcription here.

Can I update or retrain an existing model?

Absolutely. You can add new faces, remove outdated ones, or retrain models as your needs evolve — without starting from scratch.

Is your service GDPR compliant?

Yes, DeepVA is fully GDPR compliant. We take data protection and privacy seriously and ensure that all personal data is processed in accor­dance with GDPR regula­tions.

How is my data handled? Does the AI learn from my data?

You have full control over your data on our AI platform, ensuring it remains secure and compliant. By default, we do not use your data to train our models, keeping it propri­etary. However, you have the option to train models using your data, and in that case, it will remain exclusive to your organi­zation.

What type of data do you store?

By default, we do not process your data beyond what is required to provide our services. If additional processing is necessary, it will only occur as outlined in your instruc­tions or where legally required. For example, data may be trans­ferred or processed as needed to fulfill service require­ments, always in alignment with our agree­ments.

To learn more about how we process data and the safeguards in place, please refer to our Data Processing Agreement.

latest AI news

Subscribe to our newsletter

Don’t worry, we reserve our newsletter for important news, so we only send a few updates once in a while. No spam!