Machine learning training data
Large and specific groups of consumers for Voice and Image
Our clients struggle with the following issues
General data sets are not specific enough for your application. You do have 500 hours of speech but not on the topic you are looking for.
You have the right customer calls from your call center, but there are privacy sensitive elements in these calls that makes it impossible to use as training data.
Collecting real life training data is expensive. For example, installers who must take photos of 1000 meter boxes.
You have training data in English and German, but not in other important languages.
What is CG Research’ role in the project?
Our panel consists of 25,000 Dutch people who are happy to participate in market research. They also want to do other types of “assignments” for a small fee. These panel members participated in 2019 for several large projects to collect training data for machine learning.
Our customers often use the Netherlands as a pilot country and then roll out the data collection in other countries. CG plays a coordinating role and shares its best practices with our international partners. We already collected training data in Brazil, Mexico, Spain and the UK. We can do so in other European countries with our partners for qualitative research and we are also active in China and India.
We collect training data to optimize:
Voice technology where users can interact with your device or software with their voice.
Image Recognition uses artificial intelligence technology to automatically identify objects, people, places and actions in images.
Sentiment analysis is the automated process of understanding an opinion on a certain topic from written or spoken language.
Data collection in the following countries
- United Kingdom
- United States
Let’s get in touch
CG Research is the ideal partner. You determine which part you want to outsource or do yourself.