  • Computer systems are undergoing a revolution. From 1945, when the modem c;omputerera began, until about 1985, computers were large and expensive. Even minicomputers cost at least tens of thousands of dollars each. As a result, most organizations had only a handful of computers, and for lack of a way to connect them, these operated independently from one another. Starting around the the mid-1980s, however, two advances in technology began to change that situation. The first was the development of powerful microprocessors....

  • Quantitative  distribution  of  main  ions  and  other  chemical  components  of  groundwater  are characterized by theirs statistical parameters. They depend closely on probability distribution of  the  data.

  • An important chapter in the theory of distribution of zeros of entire functions pertains to the study of linear operators acting on entire functions. This dissertation presents new results involving not only linear, but also some non-linear operators.

  • Distributional similarity is a classic technique for entity set expansion, where the system is given a set of seed entities of a particular class, and is asked to expand the set using a corpus to obtain more entities of the same class as represented by the seeds. This paper shows that a machine learning model called positive and unlabeled learning (PU learning) can model the set expansion problem better. Based on the test results of 10 corpora, we show that a PU learning technique outperformed distributional similarity significantly. ...

  • Our research aims at building computational models of word meaning that are perceptually grounded. Using computer vision techniques, we build visual and multimodal distributional models and compare them to standard textual models. Our results show that, while visual models with state-of-the-art computer vision techniques perform worse than textual models in general tasks (accounting for semantic relatedness), they are as good or better models of the meaning of words with visual correlates such as color terms, even in a nontrivial task that involves nonliteral uses of such words. ...

  • In this paper we investigate how to automatically determine if two document collections are written from different perspectives. By perspectives we mean a point of view, for example, from the perspective of Democrats or Republicans. We propose a test of different perspectives based on distribution divergence between the statistical models of two collections. Experimental results show that the test can successfully distinguish document collections of different perspectives from other types of collections. ...

  • We propose a distribution-based pruning of n-gram backoff language models. Instead of the conventional approach of pruning n-grams that are infrequent in training data, we prune n-grams that are likely to be infrequent in a new document. Our method is based on the n-gram distribution i.e. the probability that an n-gram occurs in a new document. Experimental results show that our method performed 7-9% (word perplexity reduction) better than conventional cutoff methods.

  • We describe a novel method that extracts paraphrases from a bitext, for both the source and target languages. In order to reduce the search space, we decompose the phrase-table into sub-phrase-tables and construct separate clusters for source and target phrases. We convert the clusters into graphs, add smoothing/syntacticinformation-carrier vertices, and compute the similarity between phrases with a random walk-based measure, the commute time.

  • With a few exceptions, discriminative training in statistical machine translation (SMT) has been content with tuning weights for large feature sets on small development data. Evidence from machine learning indicates that increasing the training sample size results in better prediction. The goal of this paper is to show that this common wisdom can also be brought to bear upon SMT.

  • We present an automatic approach to determining whether a pronoun in text refers to a preceding noun phrase or is instead nonreferential. We extract the surrounding textual context of the pronoun and gather, from a large corpus, the distribution of words that occur within that context. We learn to reliably classify these distributions as representing either referential or non-referential pronoun instances. Despite its simplicity, experimental results on classifying the English pronoun it show the system achieves the highest performance yet attained on this important task. i...

  • A distributional method for part-of-speech induction is presented which, in contrast to most previous work, determines the part-of-speech distribution of syntactically ambiguous words without explicitly tagging the underlying text corpus. This is achieved by assuming that the word pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at this position, and by clustering the neighbor pairs on the basis of their middle words as observed in a large corpus.

  • A stochastic model based on insights of Mandelbrot (1953) and Simon (1955) is discussed against the background of new criteria of adequacy that have become available recently as a result of studies of the similarity relations between words as found in large computerized text corpora.

  • This paper presents results from experiments in automatic classification of animacy for Norwegian nouns using decision-tree classifiers. The method makes use of relative frequency measures for linguistically motivated morphosyntactic features extracted from an automatically annotated corpus of Norwegian. The classifiers are evaluated using leave-oneout training and testing and the initial results are promising (approaching 90% accuracy) for high frequency nouns, however deteriorate gradually as lower frequency nouns are classified.

  • CuJut District has favorable conditions for scaling up farms. Number of farms recently increased in the district. However, due to the fluctuation in the agricultural sector resulted in a recession, and unstable prices. The purpose of this research is to determine opportunities, and the challenges of farming production within the market system, and developing policies to encourage market growth.

  • Probability and statistics are concerned with events which occur by chance. Examples include occurrence of accidents, errors of measurements, production of defective and nondefective items from a production line, and various games of chance, such as drawing a card from a well-mixed deck, flipping a coin, or throwing a symmetrical six-sided die. In each case we may have some knowledge of the likelihood of various possible results, but we cannot predict with any certainty the outcome of any particular trial....

  • Important notice Many practical techniques described in this book involve potentially dangerous applications of electricity and engineering equipment. The authors, editors and publishers cannot take responsibility for any personal, professional or financial risk involved in carrying out these techniques, or any resulting injury, accident or loss. The techniques described in this book should only be implemented by professional and fully qualified electrical engineers using their own professional judgement and due regard to health and safety issues....

  • This hands-on guide offers practical advice on all aspects of science communication. It features a tightly interwoven fabric of issues: product types, target groups, written communication, visual communication, validation processes, practices of efficient workflow, distribution, promotion, advertising, and much more. Extremely practical, the guide provides the necessary "shortcuts" to produce outreach products of high quality. All concepts are explained with simple terms and illustrative examples while check lists and short "to-the-point" overviews enable rapid progress and quick results.

  • As a result of the gloomy economic climate in Europe brought about by the ongoing crisis affecting all enterprises (particularly SMEs) in France, the French government has recently decided to create different poles of innovation and research associated to the activities of enterprises. The objective is to propose new ideas for helping these enterprises to be more efficient and able to resist the threat to their existence from globalisation. One of the difficulties of these enterprises is that they find it difficult to penetrate the European market and export their products.

  • Overview of Mass Storage Structure Disk Structure Disk Attachment Disk Scheduling Disk Management Swap-Space Management RAID Structure Disk Attachment Stable-Storage Implementation Tertiary Storage Devices Operating System Issues Performance Issues Describe the physical structure of secondary and tertiary storage devices and the resulting effects on the uses of the devices Explain the performance characteristics of mass-storage devices Discuss operating-system services provided for mass storage, including RAID and HSM Overview of Mass Storage Structure...

  • The dramatic reduction in transmission loss of optical fibers coupled with very important developments in the area of light sources and detectors have resulted in phenomenal growth of the fiber optic industry during the last 35 years or so. Indeed, the birth of optical fiber communication systems coincided with the fabrication of low-loss optical fibers and the operation of room-temperature semiconductor lasers in 1970. Since then, scientific and technological growth in this field has been phenomenal.

