much.more about partners contacts home  
publications  

WP9: Performance Evaluatione of the Art

 

EIT worked on monolingual experiments in German and English as baselines for the cross-language experiments. Two sets of relevance assessments were produced by separate groups of medical experts (at ZInfo and at CMU). In order to check the influence of the relevance assessments on the retrieval results the monolingual German evaluation runs were repeated for ZInfo and CMU relevance assessments and checked against each other. For most of the cross-language evaluation runs German queries were used to retrieve English documents. The results of these experimenst were compared to the monolingual runs but also between different approaches. All CLIR experiments were performed with the ZInfo relevance assessments. Experiments included: CLIR via Vocabulary Overlap; CLIR via Machine Translation of the Queries; CLIR via Semantic Codes; CLIR via a Bilingual Similarity Thesaurus; CLIR with English queries and German documents; Different weighting schemes.

XRCE investigated whether an optimal combination of different resources for query translation could lead to improved results in the context of cross-language information retrieval. Data for estimating mixture weights was automatically derived by building pseudo-translation pairs in which the source words are extracted from the set of queries used in the training phase, and their corresponding target words are taken to be those occurring in all the documents judged to be relevant to the query. Experiments in this context showed that the combination thus obtained was always better than a simple merge of the resources, but we nevertheless failed to report significant improvements on the retrieval results.


 
last modified, july 2003
more   close