intTypePromotion=1

Báo cáo khoa học: "Chinese Unknown Word Identification Using Character-based Tagging and Chunking"

Chia sẻ: Nhung Nhung | Ngày: | Loại File: PDF | Số trang:4

0
26
lượt xem
1
download

Báo cáo khoa học: "Chinese Unknown Word Identification Using Character-based Tagging and Chunking"

Mô tả tài liệu
  Download Vui lòng tải xuống để xem tài liệu đầy đủ

Since written Chinese has no space to delimit words, segmenting Chinese texts becomes an essential task. During this task, the problem of unknown word occurs. It is impossible to register all words in a dictionary as new words can always be created by combining characters. We propose a unified solution to detect unknown words in Chinese texts. First, a morphological analysis is done to obtain initial segmentation and POS tags and then a chunker is used to detect unknown words.

Chủ đề:
Lưu

Nội dung Text: Báo cáo khoa học: "Chinese Unknown Word Identification Using Character-based Tagging and Chunking"

ADSENSE
ADSENSE

CÓ THỂ BẠN MUỐN DOWNLOAD

 

Đồng bộ tài khoản