intTypePromotion=1
zunia.vn Tuyển sinh 2024 dành cho Gen-Z zunia.vn zunia.vn
ADSENSE

Báo cáo khoa học: "Supervised Grammar Induction using Training Data with Limited Constituent Information *"

Chia sẻ: Nhung Nhung | Ngày: | Loại File: PDF | Số trang:7

36
lượt xem
3
download
 
  Download Vui lòng tải xuống để xem tài liệu đầy đủ

Corpus-based grammar induction generally relies on hand-parsed training data to learn the structure of the language. Unfortunately, the cost of building large annotated corpora is prohibitively expensive. This work aims to improve the induction strategy when there are few labels in the training data. We show that the most informative linguistic constituents are the higher nodes in the parse trees, typically denoting complex noun phrases and sentential clauses. They account for only 20% of all constituents. ...

Chủ đề:
Lưu

Nội dung Text: Báo cáo khoa học: "Supervised Grammar Induction using Training Data with Limited Constituent Information *"

ADSENSE

CÓ THỂ BẠN MUỐN DOWNLOAD

 

Đồng bộ tài khoản
2=>2