intTypePromotion=1
zunia.vn Tuyển sinh 2024 dành cho Gen-Z zunia.vn zunia.vn
ADSENSE

Báo cáo khoa học: "Toward Statistical Machine Translation without Parallel Corpora"

Chia sẻ: Nhung Nhung | Ngày: | Loại File: PDF | Số trang:11

52
lượt xem
3
download
 
  Download Vui lòng tải xuống để xem tài liệu đầy đủ

We estimate the parameters of a phrasebased statistical machine translation system from monolingual corpora instead of a bilingual parallel corpus. We extend existing research on bilingual lexicon induction to estimate both lexical and phrasal translation probabilities for MT-scale phrasetables. We propose a novel algorithm to estimate reordering probabilities from monolingual data. We report translation results for an end-to-end translation system using these monolingual features alone. Our method only requires monolingual corpora in source and target languages, a small bilingual dictionary, and a small bitext for tuning feature weights. In this paper, we examine an idealization where a phrase-table is...

Chủ đề:
Lưu

Nội dung Text: Báo cáo khoa học: "Toward Statistical Machine Translation without Parallel Corpora"

ADSENSE

CÓ THỂ BẠN MUỐN DOWNLOAD

 

Đồng bộ tài khoản
2=>2