We present a status report about an ongoing research project in the field of (semi-)automatic terminology acquisition at the European Academy Bolzano. The main focus will be on encoding a text corpus, which serves as a basis for applying term extraction programq. The CATEx (C_omputer A_.ssisted Terminology E___~raction) project emerged from the need to support and improve, both qualitatively and quantitatively, the manual acquisition of terminological data.
Special issue paper PAR-3D-BLAST: A parallel tool for searching and aligning protein structures present a parallel tool, parallel 3D-BLAST (PAR- 3D-BLAST), which lists the similar structures to the query protein. Each protein in the result list has a structural similarity score and an alignment to the query structure. The presented tool is implemented to ﬁt both the standalone multi-core computers and clusters of multi-core nodes. The achieved speedup is linear and scalable.
While speech recognition systems have come a long way in the last thirty years, there is still room for improvement. Although readily available, these systems are sometimes inaccurate and insufficient. The research presented here outlines a technique called Distributed Listening which demonstrates noticeable improvements to existing speech recognition methods. The Distributed Listening architecture introduces the idea of multiple, parallel, yet physically separate automatic speech recognizers called listeners. Distributed Listening also uses a piece of middleware called an interpreter.
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an English stemmer and a small (10K sentences) parallel corpus as its sole training resources. No parallel text is needed after the training phase. Monolingual, unannotated text can be used to further improve the stemmer by allowing it to adapt to a desired domain or genre.
While paraphrasing is critical both for interpretation and generation of natural language, current systems use manual or semi-automatic methods to collect paraphrases. We present an unsupervised learning algorithm for identiﬁcation of paraphrases from a corpus of multiple English translations of the same source text. Our approach yields phrasal and single word lexical paraphrases as well as syntactic paraphrases.
We describe our experience with automatic alignment of sentences in parallel English-Chinese texts. Our report concerns three related topics: (1) progress on the HKUST English-Chinese Parallel Bilingual Corpus; (2) experiments addressing the applicability of Gale ~ Church's (1991) lengthbased statistical method to the task of alignment involving a non-Indo-European language; and (3) an improved statistical method that also incorporates domain-specific lexical cues.
This detailed guide for programmers, developers, and computer enthusiasts shows how to get the most from parallel port in any application or project. The Visual-Basic code and circuit designs include examples that use the new enhanced (EPP) and expanded (EPC) modes.An excellent resource for Visual Basic programmers looking to interface hardware through standard ports. Anyone designing hardware to work with a parallel port is well advised to add this book to their library.
There are other manifestations of greatness than to relieve suffering or to wreck an empire. Julius Csar
and John Howard are not the only heroes who have smiled upon the world. In the supreme adaptation of
means to an end there is a constant nobility, for neither ambition nor virtue is the essential of a perfect action.
How shall you contemplate with indifference the career of an artist whom genius or good guidance has
compelled to exercise his peculiar skill, to indulge his finer aptitudes? A masterly theft rises in its claim to
respect high above the reprobation of the moralist.
Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Research Article An FPGA Implementation of a Parallelized MT19937 Uniform Random Number Generator
A parallel computing model for the numerical solution of the general 20 shallow water equations in conservative form has been developed, tested and implemented in the MPI parallel environment set up on a parallel computer with four 2.8 GHz CPUs in Institute of Mechanics, VAST. The model is based on a Godunov-type numerical scheme, which is devised for 20 unstructured computational meshes, and on a domain decomposition technique.
Translations from a parallel corpus implicitly deals with the granularity problem as ﬁner sense distinctions are only relevant as far as they are lexicalized in the target translations. It also facilitates the integration of WSD in multilingual applications such as multilingual Information Retrieval (IR) or Machine Translation (MT).
We present a FrameNet-based semantic role labeling system for Swedish text. As training data for the system, we used an annotated corpus that we produced by transferring FrameNet annotation from the English side to the Swedish side in a parallel corpus. In addition, we describe two frame element bracketing algorithms that are suitable when no robust constituent parsers are available. We evaluated the system on a part of the FrameNet example corpus that we translated manually, and obtained an accuracy score of 0.
This paper presents a tool for extracting multi-word expressions from corpora in Modern Greek, which is used together with a parallel concordancer to augment the lexicon of a rule-based machinetranslation system. The tool is part of a larger extraction system that relies, in turn, on a multilingual parser developed over the past decade in our laboratory. The paper reviews the various NLP modules and resources which enable the retrieval of Greek multi-word expressions and their translations: the Greek parser, its lexical database, the extraction and concordancing system. ...
In this paper, we study the problem of extracting technical paraphrases from a parallel software corpus, namely, a collection of duplicate bug reports. Paraphrase acquisition is a fundamental task in the emerging area of text mining for software engineering. Existing paraphrase extraction methods are not entirely suitable here due to the noisy nature of bug reports. We propose a number of techniques to address the noisy data problem.
We describe a grammarless method for simultaneously bracketing both halves of a parallel text and giving word alignments, assuming only a translation lexicon for the language pair. We introduce inversion-invariant transduction grammars which serve as generative models for parallel bilingual sentences with weak order constraints. Focusing on Wansduction grammars for bracketing, we formulate a normal form, and a stochastic version amenable to a maximum-likelihoodbracketing algorithm. Several extensions and experiments are discussed. ...
Dictionary lookup is a computational activity that can be greatly accelerated when performed on large amounts of text by a parallel computer such as the Connection Machine T M Computer (CM). Several algorithms for parallel dictionary lookup are discussed, including one that allows the CM to lookup words at a rate 450 times that of lookup on a Symbolics 3600 Lisp Machine.
Single processor supercomputers have achieved great speeds and have been pushing
hardware technology to the physical limit of chip manufacturing. But soon this trend
will come to an end, because there are physical and architectural bounds, which limit
the computational power that can be achieved with a single processor system. In this
book, we study advanced computer architectures that utilize parallelism via multiple
Designations used by companies to distinguish their products are often claimed as trademarks. In all instances where R&D is aware of a trademark claim, the product name appears in initial capital letters, in all capital letters, or in accordance with the vendor's capitalization preference. Readers should contact the appropriate companies for more complete information on trademarks and trademark registrations. All trademarks and registered trademarks in this book are the property of their respective holders....
This book grew of a third year optional course taught to electrical engineering students at South Bank Polytechnic.A parallel course on robot dynamics and control was taught by a colleague.For completeness,I have added here my own treatment of robots is,however,a very large subject area,which really reqires a book of its own.Many such texts already
When you have a question about C# 5.0 or the .NET CLR, this bestselling guide has precisely the answers you need. Uniquely organized around concepts and use cases, this updated fifth edition features a reorganized section on concurrency, threading, and parallel programming - including in-depth coverage of C# 5.0's new asynchronous functions.