BLOG
Access the world of language data with TAUS
Sharing insights, ideas and knowledge
Language Data
How Much Training Data Do I Need?
by
Husna Sayedi
4 Oct 2021
The amount of training data you need depends on many variables - the model you use, the task you ..
Language Data
Why Do Data Cleaning and Anonymization Matter?
by
Husna Sayedi
4 Oct 2021
Data cleaning is an essential step in machine learning and takes place before the model training ..
Language Data
Training Data Sourcing Methods
by
Husna Sayedi
4 Oct 2021
Training data can be sourced from many different places, depending on your machine learning ..
Language Data
Web Scraping for Parallel Corpora Creation
by
Lisa Vasileva
1 Oct 2021
Acquiring high-quality parallel corpora is essential for training good-performing MT engines. There ..
Food for Thought
Reconfiguring the Translation Ecosystem in the 2020s
by
Jaap van der Meer
1 Oct 2021
In our article Translation Economics of the 2020s in Multilingual Magazine I raised the question ..
Success Stories
Breaking the Publishing Ground: From Dictionaries to Linguistic Data
by
Şölen Aslan
14 Sep 2021
TAUS Data Marketplace has brought new opportunities to everyone, from individual linguists and LSPs ..
Language Data
Intent Recognition in NLP
by
Husna Sayedi
7 Sep 2021
What is Intent Recognition and Why is it Important? As our society continues to rely on ..
Language Data
Domain Classification with Natural Language Processing
by
András Aponyi
19 Aug 2021
There is a vast collection of textual data on the internet and in various organizational databases ..
Food for Thought
A Journey into the Future of the Translation Industry
by
Jaap van der Meer
3 Aug 2021
The long-expected technical revolution is here. Automatic translation is no longer just a freebie ..
Language Data
What is Sentiment Analysis? Types and Use Cases
by
Husna Sayedi
7 Jun 2021
Sentiment analysis is a subfield of Natural Language Processing (NLP) where the general sentiment ..
Language Data
When to Community-Source Your Training Data for ML
by
Milica Panić
3 Jun 2021
The amount of content that is being produced worldwide and needs translation has been surging for ..
Language Data
Data Preparation for ML: A Brief Guide
by
Husna Sayedi
1 Jun 2021
Perhaps the most pivotal step in your machine learning application is the data preparation phase. ..
❮
1
...
3
4
5
6
7
...
23
❯
CATEGORIES
Food for Thought
Language Data
Events
Press Releases
Machine Translation
Dynamic Quality Framework
Quality Estimation
Success Stories
POPULAR ARTICLES
1
Types of Training Data
2
Quality Estimation: A Smart Filter for Cost-Effective Translation Workflows
3
How Much Training Data Do I Need?
4
Quality Estimation for Machine Translation
5
Synergium use case: DQF as a Reliable Risk Management Tool
Receive All TAUS News and Updates in Your Inbox
Sign up
Home
Blog