intTypePromotion=1

Báo cáo khoa học: "Intelligent Selection of Language Model Training Data"

Chia sẻ: Hongdo_1 Hongdo_1 | Ngày: | Loại File: PDF | Số trang:5

0
41
lượt xem
2
download

Báo cáo khoa học: "Intelligent Selection of Language Model Training Data"

Mô tả tài liệu
  Download Vui lòng tải xuống để xem tài liệu đầy đủ

We address the problem of selecting nondomain-specific language model training data to build auxiliary language models for use in tasks such as machine translation. Our approach is based on comparing the cross-entropy, according to domainspecific and non-domain-specifc language models, for each sentence of the text source used to produce the latter language model. We show that this produces better language models, trained on less data, than both random data selection and two other previously proposed methods. ...

Chủ đề:
Lưu

Nội dung Text: Báo cáo khoa học: "Intelligent Selection of Language Model Training Data"

ADSENSE
ADSENSE

CÓ THỂ BẠN MUỐN DOWNLOAD

 

Đồng bộ tài khoản
2=>2