In this thesis we explore how knowledge management is performed in
open source projects. Open source projects are often perceived as informal,
even unmanaged. Still, they appear to manage knowledge acquisition
and sharing sufficiently well to successfully develop software in such
a distributed environment as the Internet. This thesis aims to explore
that apparent contradiction, and thus complement the currently limited
research in this field.
Open source software is such an integral part of the Internet that is it safe to say that the
Internet wouldn’t exist as we know it today without it. The Internet never would have
grown as fast and as dynamically as it did without open source programs such as BIND,
which controls the domain name system; Sendmail, which powers most e-mail servers;
INN, which runs many news servers; Major Domo, which runs many of the thousands of
mailing lists on the Internet; and of course the popular Apache Web server. One thing for
sure is that the Internet is a lot cheaper due to open source...
We present the S-Space Package, an open source framework for developing and evaluating word space algorithms. The package implements well-known word space algorithms, such as LSA, and provides a comprehensive set of matrix utilities and data structures for extending new or existing models. The package also includes word space benchmarks for evaluation. Both algorithms and libraries are designed for high concurrency and scalability. We demonstrate the efﬁciency of the reference implementations and also provide their results on six benchmarks. ...
This paper presents a general-purpose open source package for recognizing Textual Entailment. The system implements a collection of algorithms, providing a conﬁgurable framework to quickly set up a working environment to experiment with the RTE task. Fast prototyping of new solutions is also allowed by the possibility to extend its modular architecture. We present the tool as a useful resource to approach the Textual Entailment problem, as an instrument for didactic purposes, and as an opportunity to create a collaborative environment to promote research in the ﬁeld. ...
We describe an open-source toolkit for statistical machine translation whose novel contributions are (a) support for linguistically motivated factors, (b) confusion network decoding, and (c) efficient data formats for translation models and language models. In addition to the SMT decoder, the toolkit also includes a wide variety of tools for training, tuning and applying the system to many translation tasks.
We demonstrate an open-source natural language generation engine that produces descriptions of entities and classes in English and Greek from OWL ontologies that have been annotated with linguistic and user modeling information expressed in RDF . We also demonstrate an accompanying plug-in for the Prot´ g´ ontology editor, e e which can be used to create the ontology’s annotations and generate previews of the resulting texts by invoking the generation engine.
The use of open source softwares helps users learn the internal structure of numerical models. However, open source softwares usually require linking to auxiliary programs for data preprocessing, postprocessing, and visualization. The users also have to understand the problem thoroughly, and may at times modify the source code so that the model could better represent the natural mechanisms.
Part 1 of the book serial ebook SELinux open source security enhanced linux to part 2 of you will continue to learn about relevant issues such as: Type enforcement, ancillary policy statements, customizing SELinux policies.
We present a new open source toolkit for phrase-based and syntax-based machine translation. The toolkit supports several state-of-the-art models developed in statistical machine translation, including the phrase-based model, the hierachical phrase-based model, and various syntaxbased models. The key innovation provided by the toolkit is that the decoder can work with various grammars and offers different choices of decoding algrithms, such as phrase-based decoding, decoding as parsing/tree-parsing and forest-based decoding. ...
In this paper, we present a new collection of open-source software libraries that provides command line binary utilities and library classes and functions for compiling regular expression and context-sensitive rewrite rules into ﬁnite-state transducers, and for n-gram language modeling. The OpenGrm libraries use the OpenFst library to provide an efﬁcient encoding of grammars and general algorithms for building, modifying and applying models.
We describe Akamon, an open source toolkit for tree and forest-based statistical machine translation (Liu et al., 2006; Mi et al., 2008; Mi and Huang, 2008). Akamon implements all of the algorithms required for tree/forestto-string decoding using tree-to-string translation rules: multiple-thread forest-based decoding, n-gram language model integration, beam- and cube-pruning, k-best hypotheses extraction, and minimum error rate training.
We describe Joshua (Li et al., 2009a)1 , an open source toolkit for statistical machine translation. Joshua implements all of the algorithms required for translation via synchronous context free grammars (SCFGs): chart-parsing, n-gram language model integration, beam- and cubepruning, and k-best extraction. The toolkit also implements sufﬁx-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability.
This paper presents recent extensions to Poliqarp, an open source tool for indexing and searching morphosyntactically annotated corpora, which turn it into a tool for indexing and searching certain kinds of treebanks, complementary to existing treebank search engines. In particular, the paper discusses the motivation for such a new tool, the extended query syntax of Poliqarp and implementation and efﬁciency issues.
In the world of non-proprietary NLP software the standard, and perhaps the best, HMM-based POS tagger is TnT (Brants, 2000). We argue here that some of the criticism aimed at HMM performance on languages with rich morphology should more properly be directed at TnT’s peculiar license, free but not open source, since it is those details of the implementation which are hidden from the user that hold the key for improved POS tagging across a wider variety of languages. We present HunPos1 , a free and open source (LGPL-licensed) alternative, which can be tuned by the user to fully...
Ebook SELinux open source security enhanced linux focus presents basic issues such as: Introducing SELinux, The SELinux security model, Installing and initially configuring SELinux,... Invite you to refer to capture details.
Module 2 present open source applications and licenses. Objective summary of this module: Understanding desktop, server, and mobile applications; introducing development languages and package management.
We present REGULUS, an Open Source environment which compiles typed unification grammars into context free grammar language models compatible with the Nuance Toolkit. The environment includes a large general unification grammar of English and corpus-based tools for creating efficient domainspecific recognisers from it. We will demo applications built using the system, including a speech translator and a command and control system for a simulated robotic domain, and show how the development environment can be used to edit and extend them. ...
OpenVPN is an open source software application that implements virtual private network (VPN) techniques for creating secure point-to-point or site-to-site connections in routed or bridged configurations and remote access facilities. It uses a custom security protocol that utilizes SSL/TLS for key exchange. It is capable of traversing network address translators (NATs) and firewalls. It was written by James Yonan and is published under the GNU General Public License (GPL)