Language Data

by Anne-Maj van der Meer 11 Mar 2024
TAUS offers its data collection of close to 7.4 billion words for sale this spring at discounts of ..
by Anne-Maj van der Meer 9 Nov 2023
In the ever-evolving landscape of the global translation and localization industries, the advent of ..
by Anne-Maj van der Meer 19 Dec 2022
There is still a lack of the amounts of labeled data required to feed data-hungry neural models, ..
by Anne-Maj van der Meer 19 Dec 2022
While there are many ways to get clean data, at TAUS we distinguish ten different steps, five of ..
by Anne-Maj van der Meer 19 Dec 2022
Text summarization is the process of taking pieces from a longer text to put together a (shorter) ..
by Lahorka Nikolovski 7 Oct 2022
In recent years, NMT systems are getting better and better, some even claiming human parity. If ..
by Pamela Álvarez Ferreira 22 Jun 2022
Speech recognition is a complex mélange of linguistics, mathematics and statistics. Also known as ..
by Pamela Álvarez Ferreira 20 May 2022
Audio transcription is a service that has been seeing growing demand in recent years to help ..
by Şölen Aslan 3 Mar 2022
The AI scene of the 2010s was shaped by breakthroughs in vision-enabled technologies, from advanced ..
by András Aponyi 3 Jan 2022
Bilingual, NLP-driven word clouds are now available in TAUS Data Marketplace. In this article, we ..
by Jaap van der Meer 2 Dec 2021
This is the third article in my series on Translation Economics of the 2020s. In the first article ..
by Şölen Aslan 1 Dec 2021
Technologies such as Natural Language Processing (NLP), deep learning and computer vision have been ..