Counting algorithms

In this paper, we will present an efficient method to compute the cooccurrence counts of any pair of substring in a parallel corpus, and an algorithm that make use of these counts to create subsentential alignments on such a corpus. This algorithm has the advantage of being as general as possible regarding the segmentation of text.
6p hongvang_1 16042013 11 1 Download

Báo cáo khoa học: "Semantic Classification of Noun Phrases Using Web Counts and Learning Algorithms"
This paper investigates the use of machine learning algorithms to label modifiernoun compounds with a semantic relation. The attributes used as input to the learning algorithms are the web frequencies for phrases containing the modifier, noun, and a prepositional joining term. We compare and evaluate different algorithms and different joining phrases on Nastase and Szpakowicz’s (2003) dataset of 600 modifiernoun compounds.
6p hongvang_1 16042013 17 1 Download

Recent text and speech processing applications such as speech mining raise new and more general problems related to the construction of language models. We present and describe in detail several new and efﬁcient algorithms to address these more general problems and report experimental results demonstrating their usefulness.
8p bunbo_1 17042013 13 1 Download

Lecture Algorithm design  Chapter 5: Divide and conquer I include all of the following: Mergesort, counting inversions, closest pair of points, randomized quicksort, median and selection. For more details, inviting you refer to the above lesson.
59p youcanletgo_03 14012016 4 1 Download

In this topic, we will look at: Justification for analysis, quadratic and polynomial growth, counting machine instructions, Landau symbols, BigQ as an equivalence relation, littleo as a weak ordering.
55p allbymyself_08 22022016 5 1 Download

During the last three decades, public academic research in cryptography has exploded. While classical cryptography has been long used by ordinary people, computer cryptography was the exclusive domain of the world’s militaries since the World War II. Today, stateof theart computer cryptography is practiced outside the secured walls of the military agencies. The laypersons can now employ security practices that can protect against the most powerful adversaries.
256p qsczaxewd 25092012 25 9 Download

Let us introduce the issues involved in cryptography. First, we need to define some terms; then we give some taxonomies.
54p trada85 22012013 23 4 Download

There is no fundamental reason that a transaction must abort as aresultofanynondeterministicevent;whensystemsdochoose to abort transactions due to outside events, it is due to practical consideration. After all, forcing all other nodes in a system to wait for the node that experienced a nondeterministic event (such as a hardware failure) to recover could bring a system to a painfully long standstill.
22p yasuyidol 02042013 19 4 Download

Tuyển tập các báo cáo nghiên cứu về hóa học được đăng trên tạp chí sinh học đề tài : Development and evaluation of a clinical algorithm to monitor patients on antiretrovirals in resourcelimited settings using adherence, clinical and CD4 cell count criteria
10p sting02 16012012 21 2 Download

Once an object has been instantiated, we can use the dot operator to invoke its methods title.length(); Note: A method may return a value or not Example: String s = new String(“Hello"); int count = s.length(); System.out.println("Length of s is " + count);
63p batman_1 10012013 21 2 Download

MORE ON VOLTAGEPROCESSING TECHNIQUES 14.1 COMPARISON OF DIFFERENT VOLTAGE LEASTSQUARES ALGORITHM TECHNIQUES Table 14.11 gives a comparison for the computer requirements for the different voltage techniques discussed in the previous chapter. The comparison includes the computer requirements needed when using the normal equations given by (4.130) with the optimum leastsquares weight W given by (4.132). Table 14.
15p khinhkha 30072010 49 13 Download

A routing algorithm constructs routing tables to forward communication packets based on network status information. Rapid inflation of the Internet increases demand for scalable and adaptive network routing algorithms. Conventional protocols such as the Routing Information Protocol (RIP) (Hedrick, 1988) and the Open ShortestPath First protocol (OSPF) (Comer, 1995) are not adaptive algorithms; they because they only rely on hop count metrics to calculate shortest paths. In large networks, it is difficult to realize an adaptive algorithm based on conventional approaches. ...
16p tienvovan 11092010 41 5 Download

For small N one does better to use an algorithm whose operation count goes as a higher, i.e., poorer, power of N , if the constant in front is small enough. For N
3p babyuni 17082010 32 4 Download

For a given category, choose a small set of exemplars (or 'seed words') 2. Count cooccurrence of words and seed words within a corpus 3. Use a figure of merit based upon these counts to select new seed words 4. Return to step 2 and iterate n times 5. Use a figure of merit to rank words for category membership and o u t p u t a ranked list Our algorithm uses roughly this same generic structure, but achieves notably superior results, by changing the specifics of: what counts as cooccurrence; which figures of merit to use for...
7p bunrieu_1 18042013 17 2 Download

RIP version 2 is not a new protocol—it is RIP Version 1 with some additional fields in the route update packet, key among them being subnet mask information in each route entry. The underlying DV algorithms in RIP2 are identical to those in RIP1, implying that RIP2 still suffers from convergence problems and the maximum hop count limit of 16 hops.
13p dienham63 14122015 9 2 Download

Traditional wisdom holds that once documents are turned into bagofwords (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisingly, is able to learn a bigram language model from a set of bagofwords documents. At its heart, our approach is an EM algorithm that seeks a model which maximizes the regularized marginal likelihood of the bagofwords documents.
9p hongphan_1 15042013 16 1 Download

Frequency counts from very large corpora, such as the Web 1T dataset, have recently become available for language modeling. Omission of low frequency ngram counts is a practical necessity for datasets of this size. Naive implementations of standard smoothing methods do not realize the full potential of such large datasets with missing counts.
4p hongphan_1 15042013 16 1 Download

As the first step in an automated text summarization algorithm, this work presents a new method for automatically identifying the central ideas in a text based on a knowledgebased concept counting paradigm. To represent and generalize concepts, we use the hierarchical concept taxonomy WordNet. By setting appropriate cutoff values for such parameters as concept generality and childtoparent frequency ratio, we control the amount and level of generality of concepts extracted from the text.
3p bunmoc_1 20042013 16 1 Download

An important goal of computational linguistics has been to use linguistic theory to guide the construction of computationally efficient realworld natural language processing systems. At first glance, generalized phrase structure grammar (GPSG) appears to be a blessing on two counts. First, the precise formalisms of GPSG might be a direct and fransparent guide for parser design and implementation. Second, since GPSG has weak contextfree generative power and contextfree languages can be parsed in O(n ~) by a wide range of algorithms, GPSG parsers would appear to run in polynomial time.
10p bungio_1 03052013 19 1 Download

We explore learning prepositionalphrase attachment in Dutch, to use it as a filter in prosodic phrasing. From a syntactic treebank of spoken Dutch we extract instances of the attachment of prepositional phrases to either a governing verb or noun. Using crossvalidated parameter and feature selection, we train two learning algorithms, TB I and RIPPER, 011 making this distinction, based on unigram and bigram lexical features and a cooccurrence feature derived from WWW counts.
8p bunthai_1 06052013 17 1 Download