Advances in Natural Language Processing: 4th International by Arturo Montejo Ráez, Luís Alfonso Ureña López (auth.), José

By Arturo Montejo Ráez, Luís Alfonso Ureña López (auth.), José Luis Vicedo, Patricio Martínez-Barco, Rafael Muńoz, Maximiliano Saiz Noeda (eds.)

This booklet constitutes the refereed court cases of the 4th foreign convention, EsTAL 2004, held in Alicante, Spain in October 2004.

The forty two revised complete papers provided have been conscientiously reviewed and chosen from seventy two submissions. The papers deal with present matters in computational linguistics and monolingual and multilingual clever language processing and functions, particularly written language research and new release; pragmatics, discourse, semantics, syntax, and morphology; lexical assets; observe experience disambiguation; linguistic, mathematical, and morphology; lexical assets; be aware feel disambiguation; linguistic, mathematical, and mental types of language; wisdom acquisition and illustration; corpus-based and statistical language modeling; computer translation and translation instruments; and computational lexicography; info retrieval; extraction and query answering; computerized summarization; record categorization; normal language interfaces; and discussion platforms and review of systems.

The technique of choosing all the synsets containing a given word (with or without closed-class words) was clearly poorer than all the others. – The metric obtained looking for coincidences of the dependences between words in the candidate answer and in the references was also inferior to the other configurations. This may be due to the fact that the dependences are only collected for some 32 E. Alfonseca and D. P´erez Table 3. 3452 Deps. 1691 words because the parses are incomplete, and much information is lost using this representation.

An English student would see the question translated automatically into English, would write the answer, and next the system would automatically translate it into Spanish and score it against the teacher’s references. As can be seen from the results, the loss of accuracy is small; for some of the questions, the correlation in the best configuration even increases. Again, the removal of closed-class words seems to give better results, and there are two cases in which Word-Sense Disambiguation is useful.

4 Artificial Neural Networks Using a neural network simulator developed at our lab, and the same feature vectors used in the previous experiment, we trained a binary classifier, which computes the probability of misalignment for each segment. As we did in the trainning of the regression tree, we had to apply a threshold to the outputs of the neural network. The variation of this threshold created the lower dashed line of Fig. 4. 5 Hidden Markov Models Two one-state models were created for each phonetic segment.

