Real-Time Identification of Named Entities in Online Basque-language Media

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Published 15-01-2021
Joseba Fernández de Landa
Rodrigo Agerri

Abstract

Names referring to people, institutions, or places may be defined as named entities. Extracting named entities from news texts can help to identify the most commented topics talked about in news media. The main objective of this work is to identify in real-time those named entities that are most commented upon on Basque-language online media. In order to do so, we develop a system to automatically collect and annotate the named entities appearing in news written in Basque language. The annotation of named entities is performed using state-of-the-art deep learning models. Finally, the most frequent identified entities are published weekly in a Wikipedia page to display which entities do not currently have an article in the Basque Wikipedia.
Abstract 336 | PDF (Euskara) Downloads 250

##plugins.themes.bootstrap3.article.details##

Keywords

Basque-language Media, Named Entity Recognition, Natural Language Processing

Section
Ale Arrunta