Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical approach to Pinyin-based Chinese input. This approach uses a trigram-based language model and a statistically based segmentation. Also, to deal with real input, it also includes a typing model which enables spelling correction in sentence-based Pinyin input, and a spelling model for English which enables modeless Pinyin input.
Tuyển tập những bài báo cáo nghiên cứu khoa học hay nhất được đăng trên tạp chí JOURNAL OF FOREST SCIENCE đề tài: Multivariate statistical approach to comparison of the nutrient status of Norway spruce (Picea abies [L.] Karst.) and top-soil properties in differently managed forest stands...
Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Editorial Network Structure and Biological Function: Reconstruction, Modeling, and Statistical Approaches
Tuyển tập các báo cáo nghiên cứu về y học được đăng trên tạp chí y học quốc tế cung cấp cho các bạn kiến thức về ngành y đề tài: "A statistical approach to estimating the strength of cell-cell interactions under the differential adhesion hypothesis
This paper describes an all level approach on statistical natural language translation (SNLT). W i t h o u t any predefined knowledge the system learns a statistical translation lexicon (STL), word classes (WCs) and translation rules (TRs) from a parallel corpus thereby producing a generalized form of a word alignment (WA). The translation process itself is realized as a beam search.
Tuyển tập các báo cáo nghiên cứu về y học được đăng trên tạp chí y học Wertheim cung cấp cho các bạn kiến thức về ngành y đề tài: A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data...
Tuyển tập các báo cáo nghiên cứu về y học được đăng trên tạp chí y học Wertheim cung cấp cho các bạn kiến thức về ngành y đề tài: Computational and statistical approaches to analyzing variants identified by exome sequencing...
In this paper, we present a statistical approach for dialogue act processing in the dialogue component of the speech-to-speech translation system VERBMOBIL. Statistics in dialogue processing is used to predict follow-up dialogue acts. As an application example we show how it supports repair when unexpected dialogue states occur.
We present a natural language interface system which is based entirely on trained statistical models. The system consists of three stages of processing: parsing, semantic interpretation, and discourse. Each of these stages is modeled as a statistical process. The models are fully integrated, resulting in an end-to-end system that maps input utterances into meaning representation frames.
Combines a cookbook approach with the use of PCs and programmable calculators. Contains statistics suitable for the low number of samples, high-pressure situations commonly found in established analytical methods with algorithms to eliminate statistical table handling, sample programs and data sets th
When sitting in statistics classes or when trying to read and understand
statistical material, too many otherwise intelligent and capable students and
researchers feel dumb. This book is intended as an antidote. It is designed to
make you feel smart and competent. Its approach is conservative in that it
attempts to identify and present the essentials of data analysis as developed by
statisticians over the last two or three centuries.
In addition to covering statistical methods, most of the existing books on
equating also focus on the practice of equating, the implications of test development
and test use for equating practice and policies, and the daily equating challenges
that need to be solved. In some sense, the scope of this book is narrower than of
other existing books: to view the equating and linking process as a statistical
FACTORS THAT INFLUENCE THE DECENTRALIZATION OF THE INFORMATION SYSTEMS UNIT IN ORGANIZATIONS : A CONTIGENCY APPROACH Figure 1.2 displays the average allocation of school effectiveness in markets with
three and ten equally-sized districts. Panel A depicts the case where parents are unconcerned
about the peer group, as in the left-hand panels of Figure 1.1. Here, families must be
perfectly sorted on school effectiveness in equilibrium, and the average μ s depicted in the
figure are simply order statistics from the standard normal distribution....
Spelling correction for keyword-search queries is challenging in restricted domains such as personal email (or desktop) search, due to the scarcity of query logs, and due to the specialized nature of the domain. For that task, this paper presents an algorithm that is based on statistics from the corpus data (rather than the query log). This algorithm, which employs a simple graph-based approach, can incorporate different types of data sources with different levels of reliability (e.g., email subject vs.
We present a set of algorithms that enable us to translate natural language sentences by exploiting both a translation memory and a statistical-based translation model. Our results show that an automatically derived translation memory can be used within a statistical framework to often ﬁnd translations of higher probability than those found using solely a statistical model.
In this paper, we present a novel approach which incorporates the web-derived selectional preferences to improve statistical dependency parsing. Conventional selectional preference learning methods have usually focused on word-to-class relations, e.g., a verb selects as its subject a given nominal class.
Inspired by previous preprocessing approaches to SMT, this paper proposes a novel, probabilistic approach to reordering which combines the merits of syntax and phrase-based SMT. Given a source sentence and its parse tree, our method generates, by tree operations, an n-best list of reordered inputs, which are then fed to standard phrase-based decoder to produce the optimal translation. Experiments show that, for the NIST MT-05 task of Chinese-toEnglish translation, the proposal leads to BLEU improvement of 1.56%. ...
Often, the training procedure for statistical machine translation models is based on maximum likelihood or related criteria. A general problem of this approach is that there is only a loose relation to the ﬁnal translation quality on unseen text. In this paper, we analyze various training criteria which directly optimize translation quality. These training criteria make use of recently proposed automatic evaluation metrics.