TRENDMINER: Large-scale Cross-lingual Trend Mining of Real-time media streams

The recent massive growth in online media and the rise of user-authored content (e.g weblogs, Twitter, Facebook) has lead to challenges of how to access and interpret these strongly multilingual data, in a timely, efficient, and affordable manner. Scientifically, streaming online media pose new challenges, due to their shorter, noisier, and more colloquial nature. Moreover, they form a temporal stream strongly grounded in events and context. Consequently, existing language technologies fall short onaccuracy, scalability and portability. The goal of this project is to deliver.

MULTIMEDICA: Multilingual Information Extraction in Health domain and application to scientific and informative documents

The aim of this project is to define and develop information extraction and retrieval techniques based on texts from the medical domain. This will be carried out following two basic tasks: firstly, processing scientific documents in English about pharmacology, and secondly, processing informative texts about health topics in other languages such as Spanish and Arabic.

Improving the access and visibility of the multilingual information in the Region of Madrid

MAVIR Consortium is a research network co-funded by the Regional Government of Madrid under the IV Plan Regional de Investigación Científica e Innovación Tecnológica (IV PRICIT) integrating a multi-disciplinar team made of scientists, engineers, linguists and documentalists working together on two main areas:Human Language Technologies and Scientific Communication via WWW. The thematic network suggested includes 25 doctors organised in 6 research groups (UNED, UAM, UC3M, UEM, UPM and CINDOC) of the CM that from a multi-discipline perspective complement in various dimensions: academic vs.