zunia.vn

Tuyển sinh 2024 dành cho Gen-Z

zunia.vn

» Luận Văn - Báo Cáo

» Báo cáo khoa học

Syntactic annotation

Xem 1-20 trên 49 kết quả Syntactic annotation

Adapting word order transformation for Vietnamese dependency parsing

Dependency parsing, which is the task of automatically doing syntactic analysis and defining the binary dependencies between words in a sentence, has gained much attention from researchers in recent years. Besides that, to build effective and robust dependency parsers, we also need a large number of annotated treebanks to train the models. However, constructing such treebanks is complicated and requires considerable human effort.

8p visherylsandberg 18-05-2022 11 2 Download

Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles

Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature.

14p viflorida2711 30-10-2020 9 2 Download

Báo cáo khoa học: "A Structured Language Model"

The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint sequence of words-binary-parse-structure with headword annotation. The model, its probabilistic parametrization, and a set of experiments meant to evaluate its predictive power are presented.

3p bunthai_1 06-05-2013 34 1 Download
Báo cáo khoa học: "ADOP Model for Semantic Interpretation*"

In data-oriented language processing, an annotated language corpus is used as a stochastic grammar. The most probable analysis of a new sentence is constructed by combining fragments from the corpus in the most probable way. This approach has been successfully used for syntactic analysis, using corpora with syntactic annotations such as the Penn Tree-bank. If a corpus with semantically annotated sentences is used, the same approach can also generate the most probable semantic interpretation of an input sentence. The present paper explains this semantic interpretation method. ...

9p bunthai_1 06-05-2013 51 3 Download
Báo cáo khoa học: "A platform for collaborative semantic annotation"

Data-driven approaches in computational semantics are not common because there are only few semantically annotated resources available. We are building a large corpus of public-domain English texts and annotate them semi-automatically with syntactic structures (derivations in Combinatory Categorial Grammar) and semantic representations (Discourse Representation Structures), including events, thematic roles, named entities, anaphora, scope, and rhetorical structure. We have created a wiki-like Web-based platform on which a crowd of expert annotators (i.e.

5p bunthai_1 06-05-2013 50 2 Download
Báo cáo khoa học: "Framework of Semantic Role Assignment based on Extended Lexical Conceptual Structure: Comparison with VerbNet and FrameNet"

Widely accepted resources for semantic parsing, such as PropBank and FrameNet, are not perfect as a semantic role labeling framework. Their semantic roles are not strictly deﬁned; therefore, their meanings and semantic characteristics are unclear. In addition, it is presupposed that a single semantic role is assigned to each syntactic argument. This is not necessarily true when we consider internal structures of verb semantics. We propose a new framework for semantic role annotation which solves these problems by extending the theory of lexical conceptual structure (LCS). ...

10p bunthai_1 06-05-2013 44 2 Download
Báo cáo khoa học: "Feature-Rich Part-of-speech Tagging for Morphologically Complex Languages: Application to Bulgarian"

We present experiments with part-ofspeech tagging for Bulgarian, a Slavic language with rich inﬂectional and derivational morphology. Unlike most previous work, which has used a small number of grammatical categories, we work with 680 morpho-syntactic tags. We combine a large morphological lexicon with prior linguistic knowledge and guided learning from a POS-annotated corpus, achieving accuracy of 97.98%, which is a signiﬁcant improvement over the state-of-the-art for Bulgarian.

11p bunthai_1 06-05-2013 47 3 Download
Báo cáo khoa học: "Empirical evaluations of animacy annotation"

This article presents empirical evaluations of aspects of annotation for the linguistic property of animacy in Swedish, ranging from manual human annotation, automatic classiﬁcation and, ﬁnally, an external evaluation in the task of syntactic parsing.

9p bunthai_1 06-05-2013 55 2 Download
Báo cáo khoa học: "Automatic Single-Document Key Fact Extraction from Newswire Articles"

This paper addresses the problem of extracting the most important facts from a news article. Our approach uses syntactic, semantic, and general statistical features to identify the most important sentences in a document. The importance of the individual features is estimated using generalized iterative scaling methods trained on an annotated newswire corpus.

9p bunthai_1 06-05-2013 29 1 Download
Báo cáo khoa học: "Clique-Based Clustering for improving Named Entity Recognition systems"

We propose a system which builds, in a semi-supervised manner, a resource that aims at helping a NER system to annotate corpus-speciﬁc named entities. This system is based on a distributional approach which uses syntactic dependencies for measuring similarities between named entities. The speciﬁcity of the presented method however, is to combine a clique-based approach and a clustering technique that amounts to a soft clustering method.

9p bunthai_1 06-05-2013 35 2 Download
Báo cáo khoa học: "Parsing Arabic Dialects"

The Arabic language is a collection of spoken dialects with important phonological, morphological, lexical, and syntactic differences, along with a standard written language, Modern Standard Arabic (MSA). Since the spoken dialects are not ofﬁcially written, it is very costly to obtain adequate corpora to use for training dialect NLP tools such as parsers. In this paper, we address the problem of parsing transcribed spoken Levantine Arabic (LA). We do not assume the existence of any annotated LA corpus (except for development and testing), nor of a parallel corpus LAMSA. ...

8p bunthai_1 06-05-2013 44 2 Download
Báo cáo khoa học: "Finite Structure Query: A Tool for Querying Syntactically Annotated Corpora"

In recent years large amounts of electronic texts have become available providing a new base for empirical studies in linguistics and offering a chance to linguists to compare their theories with large amounts of utterances from "the real world". While tagging with morphosyntactic categories has become a standard for almost all corpora, more and more of them are nowadays annotated with refined syntactic information. Examples are the Penn Treebank (Marcus et al.

8p bunthai_1 06-05-2013 48 2 Download
Báo cáo khoa học: "Arabic Syntactic Trees: from Constituency to Dependency"

This research note reports on the work in progress which regards automatic transformation of phrase-structure syntactic trees of Arabic into dependency-driven analytical ones. Guidelines for these descriptions have been developed at the Linguistic Data Consortium, University of Pennsylvania, and at the Faculty of Mathematics and Physics and the Faculty of Arts, Charles University in Prague, respectively.

4p bunthai_1 06-05-2013 43 1 Download
Báo cáo khoa học: "Manually Annotated Hungarian Corpus"

Current paper presents the results of a two-year project during which a consortium of the University of Szeged and the MorphoLogic Ltd. Budapest developed a morpho-syntactically parsed and annotated (disambiguated) corpus for Hungarian. For morpho-syntactic encoding, the Hungarian version of MSD (MorphoSyntactic Description) has been used. The corpus contains texts of five different topic areas: schoolchildren's compositions, fiction, computer-related texts, news, and legal texts. During annotation, linguists have checked the morphosyntactic parsing of each word. ...

4p bunthai_1 06-05-2013 48 2 Download
Báo cáo khoa học: "RIGHT ASSOCIATION REVISITED"

Consideration of when Right Association works and when it fails lead to a restatement of this parsing principle in terms of the notion of heaviness. A computational investigation of a syntactically annotated corpus provides evidence for this proposal and suggest circumstances when RA is likely to make correct attachment predictions.

3p bunmoc_1 20-04-2013 40 1 Download
Báo cáo khoa học: "A Common Framework for Syntactic Annotation"

It is widely recognized that the proliferation of annotation schemes runs counter to the need to re-use language resources, and that standards for linguistic annotation are becoming increasingly mandatory. To answer this need, we have developed a representation framework comprised of an abstract model for a variety of different annotation types (e.g., morpho-syntactic tagging, syntactic annotation, co-reference annotation, etc.), which can be instantiated in different ways depending on the annotators approach and goals. ...

8p bunrieu_1 18-04-2013 41 2 Download
Báo cáo khoa học: "Automatic Labeling of Semantic Roles"

We present a system for identifying the semantic relationships, or semantic roles, lled by constituents of a sentence within a semantic frame. Various lexical and syntactic features are derived from parse trees and used to derive statistical classi ers from hand-annotated training data.

9p bunrieu_1 18-04-2013 44 3 Download
Báo cáo khoa học: "The FrameNet Data and Software"

The FrameNet project has developed a lexical knowledge base providing a unique level of detail as to the the possible syntactic realizations of the speciﬁc semantic roles evoked by each predicator, for roughly 7,000 lexical units, on the basis of annotating more than 100,000 example sentences extracted from corpora. An interim version of the FrameNet data was released in October, 2002 and is being widely used.

4p bunbo_1 17-04-2013 42 2 Download
Báo cáo khoa học: "Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Extracting Relations"

Extracting semantic relationships between entities is challenging because of a paucity of annotated data and the errors induced by entity detection modules. We employ Maximum Entropy models to combine diverse lexical, syntactic and semantic features derived from the text. Our system obtained competitive results in the Automatic Content Extraction (ACE) evaluation. Here we present our general approach and describe our ACE results.

4p bunbo_1 17-04-2013 49 1 Download
Báo cáo khoa học: "Using linguistic principles to recover empty categories"

This paper describes an algorithm for detecting empty nodes in the Penn Treebank (Marcus et al., 1993), finding their antecedents, and assigning them function tags, without access to lexical information such as valency. Unlike previous approaches to this task, the current method is not corpus-based, but rather makes use of the principles of early Government-Binding theory (Chomsky, 1981), the syntactic theory that underlies the annotation.

8p bunbo_1 17-04-2013 41 2 Download

+

Xem thêm 49 Syntactic annotation khác

CHỦ ĐỀ BẠN MUỐN TÌM

TOP DOWNLOAD

TL.01: Bộ Tiểu Luận Triết Học

207 tài liệu

1446 lượt tải

LV.09: Bộ Luận Văn Tốt Nghiệp Chuyên Ngành Quản Trị Kinh Doanh

81 tài liệu

1627 lượt tải

LV.01: Bộ Luận Văn Thạc Sĩ Quản Trị Kinh Doanh MBA

165 tài liệu

2055 lượt tải

THÔNG TIN

TRỢ GIÚP

HỖ TRỢ KHÁCH HÀNG

Theo dõi chúng tôi

Chịu trách nhiệm nội dung:

Nguyễn Công Hà - Giám đốc Công ty TNHH TÀI LIỆU TRỰC TUYẾN VI NA

LIÊN HỆ

Địa chỉ: P402, 54A Nơ Trang Long, Phường 14, Q.Bình Thạnh, TP.HCM

Hotline: 093 303 0098

Email: support@tailieu.vn

Giấy phép Mạng Xã Hội số: 670/GP-BTTTT cấp ngày 30/11/2015 Copyright © 2022-2032 TaiLieu.VN. All rights reserved.