Evaluation Methods in Medical Informatics, Second Edition is a heavily updated and revised volume based on editors Friedman and Wyatt's successful first edition. This book incorporates the solid foundation of evaluation theories, methods, and techniques laid out in the first edition, and builds on it to include case studies from real world situations. Designed as a guide for both the informatics novice and the seasoned professional seeking a comprehensive resource, this book explores information systems evaluation from the ground up.
The main goal of this paper is to propose automatic schemes for the translation paired comparison method. This method was proposed to precisely evaluate a speech translation system's capability. Furthermore, the method gives an objective evaluation result, i.e., a score of the Test of English for International Communication (TOEIC). The TOEIC score is used as a measure of one's speech translation capability. However, this method requires tremendous evaluation costs. Accordingly, automatization of this method is an important subject for study.
As described in this paper, we propose a new automatic evaluation method for machine translation using noun-phrase chunking. Our method correctly determines the matching words between two sentences using corresponding noun phrases. Moreover, our method determines the similarity between two sentences in terms of the noun-phrase order of appearance. Evaluation experiments were conducted to calculate the correlation among human judgments, along with the scores produced using automatic evaluation methods for MT outputs obtained from the 12 machine translation systems in NTCIR7.
In this paper, I discuss issues pertinent to the design of a task-based evaluation methodology for a spoken machine translation (MT) system processing human to human communication rather than human to machine communication. I claim that system mediated human to human communication requires new evaluation criteria and metrics based on goal complexity and the speaker's prioritization of goals. ystem
We present the results of an experiment on extending the automatic method of Machine Translation evaluation BLUE with statistical weights for lexical items, such as tf.idf scores. We show that this extension gives additional information about evaluated texts; in particular it allows us to measure translation Adequacy, which, for statistical MT systems, is often overestimated by the baseline BLEU method. The proposed model uses a single human reference translation, which increases the usability of the proposed method for practical purposes. ...
In every text, some words have frequency appearance and are considered as keywords because they have strong relationship with the subjects of their texts, these words frequencies change with time-series variation in a given period. However, in traditional text dealing methods and text search techniques, the importance of frequency change with time-series variation is not considered. Therefore, traditional methods could not correctly determine index of word’s popularity in a given period.
Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành y học dành cho các bạn tham khảo đề tài: EQUIP: Implementing chronic care principles and applying formative evaluation methods to improve care for schizophrenia: QUERI Series
Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài:The role of interleukin-12 in the heavy metal-elicited immunomodulation: relevance of various evaluation methods
In this paper a novel solution to automatic and unsupervised word sense induction (WSI) is introduced. It represents an instantiation of the ‘one sense per collocation’ observation (Gale et al., 1992). Like most existing approaches it utilizes clustering of word co-occurrences. This approach differs from other approaches to WSI in that it enhances the effect of the one sense per collocation observation by using triplets of words instead of pairs. The combination with a two-step clustering process using sentence co-occurrences as features allows for accurate results.
A major focus of current work in distributional models of semantics is to construct phrase representations compositionally from word representations. However, the syntactic contexts which are modelled are usually severely limited, a fact which is reﬂected in the lexical-level WSD-like evaluation methods used.
Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Research Article Video Summarization Based on Camera Motion and a Subjective Evaluation Method
The use of ﬁgurative language is ubiquitous in natural language texts and it is a serious bottleneck in automatic text understanding. We address the problem of interpretation of logical metonymy, using a statistical method. Our approach originates from that of Lapata and Lascarides (2003), which generates a list of nondisambiguated interpretations with their likelihood derived from a corpus. We propose a novel sense-based representation of the interpretation of logical metonymy and a more thorough evaluation method than that of Lapata and Lascarides (2003). ...
To solve a problem of how to evaluate computer-produced summaries, a number of automatic and manual methods have been proposed. Manual methods evaluate summaries correctly, because humans evaluate them, but are costly. On the other hand, automatic methods, which use evaluation tools or programs, are low cost, although these methods cannot evaluate summaries as accurately as manual methods. In this paper, we investigate an automatic evaluation method that can reduce the errors of traditional automatic methods by using several evaluation results obtained manually. ...
In this paper we describe two new objective automatic evaluation methods for machine translation. The first method is based on longest common subsequence between a candidate translation and a set of reference translations. Longest common subsequence takes into account sentence level structure similarity naturally and identifies longest co-occurring insequence n-grams automatically.
Time Series Forecasting – Part I presents about What is a Time Series? Components of Time Series, Evaluation Methods of Forecast, Smoothing Methods of Time Series, Time series models, Components of a time series, Trend component.
Time management skill is very essential to students in their study or in other extracurricular
activities, and needs more serious concern. The object of the research was to evaluate time
management skill of the 3rd year students of FFL-HUT to see if it is good or not, because they have
been facing a large amount of time-consuming assignments. Data were gathered using two
methods: content analysis on requirements of ESP subjects, on relevant materials, and
questionnaire survey on selected 3rd year students of FFL-HUT....
Bài giảng Chapter 10: The Basics of Capital Budgeting: Evaluating Cash Flows provides about Overview and “vocabulary”, Methods (Payback, discounted payback; NPV; IRR, MIRR; Profitability Index), Unequal lives, Economic life.
Organic expanders play an important role in the lead–acid battery. This study presents the design of an electrochemical method which evaluates the influence of some commercial expanders on the performance of the negative plate. Examination is made of different commercial expander formulations which are prepared in the laboratory under carefully controlled conditions.