AI model training that fits your content

Training AI models doesn’t have to be a complex process locked behind layers of code and data science jargon. With DeepVA, you can train your own AI model — faces, voices, landmarks — using the media you already have.

custom face training

Train faces you know. Recognize the people who matter.

With DeepVA’s Face Training model, identifying familiar faces across your video archives becomes effortless. Just upload a few clips or images — the system handles the rest. It crops, detects, and starts training your custom model using DeepVA’s Deep Model Customizer.

Custom facial recognition tailored to your needs

Train your own facial recognition models using company-specific data — from internal stakeholders to regular guests or local public figures.

custom speaker training

Give voice to your content: Speaker Training

Need to attribute speech to the right person? The Speaker Training model has your back. Upload labeled audio files — short clips, interviews, even podcast excerpts — and DeepVA builds speaker recognition models that actually know who’s talking.

Custom speaker identification models for your organization

Train AI models to recognize specific voices — such as presenters, executives, or recurring guests — based on your own audio or video content.

custom landmark training

Places have voices too: Landmark Training

Some stories are tied to where they happened. Landmark Training helps you train AI models that spot specific locations — like city halls, cultural landmarks, or branded spaces — straight from your own footage or imagery.

Custom recognition of locations, buildings, and environment

Train AI models to recognize company-specific, regional, or historically relevant landmarks based on your own image or video data.

Dataset evaluation

AI model evaluation

With DeepVA you can automatically evaluate the quality of your training data for face and landmark recognition. The system checks factors like image resolution, relative size, and positioning of relevant features—providing clear feedback to help you improve your dataset before training. This ensures more accurate and reliable AI model results, without requiring any machine learning expertise.

Custom AI model training module is part of our Deep Media Customizer application. Check it out now:

Deep Model Customizer

Create custom AI models with ease

The benefits of DeepVA custom AI model training

Adapt the AI to your needs

Tailor recognition models for faces, voices, and landmarks to reflect company-specific, regional, or contextual needs.

No-code training interface for fast deployment

Quickly build custom AI models through a no-code interface – even with very limited training data, thanks to few-shot learning.

Seamless integration with media workflows

Trained models can be deployed immediately across the DeepVA ecosystem or integrated with third-party systems via API for consistent cross-platform recognition.

Practical Applications

Custom AI model training

Track custom individuals in video archives, identify your speakers in audio, and detect specific locations or objects—ideal for media companies, security teams, and content platforms needing precise, automated analysis.

Real-world success stories

Broadcaster

rbb

rbb’s collaboration with DeepVA transformed regional broadcasting and archiving through efficient AI utilization.

City archive

Heilbronn

DeepVA helps the Heilbronn City Archive to automatically index and digitize a large amount of images.

Broadcaster

Bayerischer Rundfunk

Creating and managing training data for face recognition using AI requires a lot of time and resources.

Have a question? We’ve got answers

What is AI Model Training?

AI model training is the process of teaching an artificial intelligence system how to recognize specific patterns — like faces, voices, or landmarks — using examples that you provide. You upload your own media (images, video clips, audio), label what’s relevant, and the system learns to identify those patterns in future content.

But here’s something important: when you train an AI model with DeepVA, your data stays yours. We don’t store, reuse, or learn from your training data — ever. The models you train are isolated to your organization, and all data handling is fully GDPR-compliant. No hidden training on your content, no surprise usage, and no shared datasets behind the scenes.

It’s not just smart — it’s secure. You’re in full control of your training process, your media, and your results. That means you get a custom AI model that truly fits your world — without compromising privacy or ownership.

How many identities can I train in a single model?

You can train up to several thousand individual identities, depending on the dataset and desired accuracy. The model scales flexibly.

How much data is required to train a person’s face?

As little as 5–10 high-quality images per person can yield strong results. More data improves accuracy and confidence scores.

What kinds of landmarks can be trained and how much training data is needed?

Anything visually consistent and identifiable — including buildings, monuments, landscapes, and even brand-specific environments or interior spaces.

Typically, 20–50 high-quality images per landmark provide a solid training base. More images improve the precision and confidence score of detection.

How many speakers can I train in a single model?

The system supports hundreds of trained speaker profiles. Larger-scale deployments can be customized based on your infrastructure and performance needs.

How much audio is needed to train a speaker? How many speakers can I train in a single model?

Around 2–5 minutes of clear speech per person is sufficient for reliable training. More data improves confidence and accuracy.

Can I combine Speaker Training with transcription tools?

Yes. The model integrates seamlessly with transcription workflows, attributing each segment to the correct speaker for cleaner, more structured output — you can read more about audio transcription here.

Can I update or retrain an existing model?

Absolutely. You can add new faces, remove outdated ones, or retrain models as your needs evolve — without starting from scratch.

Yes, DeepVA is fully GDPR compliant. We take data protection and privacy seriously and ensure that all personal data is processed in accordance with GDPR regulations.

How is my data handled? Does the AI learn from my data?

You have full control over your data on our AI platform, ensuring it remains secure and compliant. By default, we do not use your data to train our models, keeping it proprietary. However, you have the option to train models using your data, and in that case, it will remain exclusive to your organization.

What type of data do you store?

By default, we do not process your data beyond what is required to provide our services. If additional processing is necessary, it will only occur as outlined in your instructions or where legally required. For example, data may be transferred or processed as needed to fulfill service requirements, always in alignment with our agreements.

To learn more about how we process data and the safeguards in place, please refer to our Data Processing Agreement.

Have more questions? Contact us

Our AI applications

by solution

by customer story