
EUSKORPUS

The great digital library of the Basque language.
We are developing the digital library that machines need to understand and speak Basque. We generate massive corpora and open source models to ensure a functional and competitive Basque language.
DESCRIPTION

Digital corpus.
A digital corpus is like an infinite library, but for training artificial intelligence. It includes everything from everyday conversations to specialized texts.
The project contributes to the preservation and maintenance of the Basque language in digital environments.

Why is it vital?
Because without data, there is no AI. And without AI, Basque is left off the digital map. Euskorpus is the foundation that will enable the development of voice assistants, machine translators, chatbots, and a thousand other applications in Basque, while promoting a positive impact on both the industrial fabric and the social sphere, and aligning with the European framework for digital linguistic resources.
THE 3 PHASES

A clear plan.
A guaranteed impact.
We collect and label rich and diverse content.

We develop open-source AI models.

We put them at the service of industry and society.

