Home
Insights
Blog
Language Data
Language Data
Language Data
TAUS Data Sale to Boost Multilingual LLMs
by
Anne-Maj van der Meer
11 Mar 2024
TAUS offers its data collection of close to 7.4 billion words for sale this spring at discounts of ..
Language Data
Transforming Translations: The Crucial Role of Language Data in the Age of Large Language Models and Generative AI
by
Anne-Maj van der Meer
9 Nov 2023
In the ever-evolving landscape of the global translation and localization industries, the advent of ..
Language Data
Domain Adaptation: Types and Methods
by
Anne-Maj van der Meer
19 Dec 2022
There is still a lack of the amounts of labeled data required to feed data-hungry neural models, ..
Language Data
Ten-Step Guide to Data Cleaning
by
Anne-Maj van der Meer
19 Dec 2022
While there are many ways to get clean data, at TAUS we distinguish ten different steps, five of ..
Language Data
A Brief Introduction to Text Summarization
by
Anne-Maj van der Meer
19 Dec 2022
Text summarization is the process of taking pieces from a longer text to put together a (shorter) ..
Language Data
Synthetic Data Generation for Neural Machine Translation
by
Lahorka Nikolovski
7 Oct 2022
In recent years, NMT systems are getting better and better, some even claiming human parity. If ..
Language Data
What is Speech Recognition and how to do it?
by
Pamela Álvarez Ferreira
22 Jun 2022
Speech recognition is a complex mélange of linguistics, mathematics and statistics. Also known as ..
Language Data
Types of Audio Transcription and when to use them
by
Pamela Álvarez Ferreira
20 May 2022
Audio transcription is a service that has been seeing growing demand in recent years to help ..
Language Data
Natural Language Technologies (NLT) to Drive the Next Generation of AI Solutions
by
Şölen Aslan
3 Mar 2022
The AI scene of the 2010s was shaped by breakthroughs in vision-enabled technologies, from advanced ..
Language Data
NLP-driven Word Clouds in Data Marketplace
by
András Aponyi
3 Jan 2022
Bilingual, NLP-driven word clouds are now available in TAUS Data Marketplace. In this article, we ..
Language Data
Data-Enhanced Machine Translation
by
Jaap van der Meer
2 Dec 2021
This is the third article in my series on Translation Economics of the 2020s. In the first article ..
Language Data
Data and AI Trends in 2022
by
Şölen Aslan
1 Dec 2021
Technologies such as Natural Language Processing (NLP), deep learning and computer vision have been ..
1
2
3
4
5
...
6
❯
Home
Blog