Chương 1: Các khái niệm cơ bản MIRS Issues

Nguyễn Thị Oanh Bộ môn HTTT – Viện CNTT & TT oanhnt@soict.hut.edu.vn

1

Architecture

 Main operations – Insert new item – Retrieval

2

Main operations

3

Data Model

 Data model determines :

 Requirements: DM should be

– How information is organized and stored – What types of queries are supported

relationships

– extensible, new data type can be added – able to represent basic media type + temporal, spatial

at different levels of abstraction

– flexible so that items can be specified, queried, searched

4

– allow efficient storage and search

A General Multimedia Data Model

 OO-based  Multiple layers: – Object layer

 Spatial relationship: window size + position for each item  Temporal specification: timeline-based: start time + duration

 Common media type  Features or attributes for each media type are specified  ex.: image: size, color histogram, main objects contained

– Media type layer:

 Specifies the media formats   for proper encoding, analysis & presentation

5

– Media format layer

A General Multimedia Data Model

6

Data Model: Remaining issues

 Each layer:

 Most MIRS: application-specific

– Not completely designed – No common standard

– Limited number of features – Limited number of data type

- VIMSYS: image + video - a general video model - virage image schema structure

7

Special data model for each application:

Data Model: Example

 VIMSYS (Visual Information Management System)

Define events that can be queried

User-defined entity:

1 concept (sunset ) or a physical entity (heart)

Segmentation layer (temporal, spatial info, ..) + Feature layer: histogram, texture

Data + transformation (compression, color space conversion, image enhancement, …

8

User interface

 Requirements:

allow user to :

– Insert database items easily – effectively and effeciently enter queries – Present query results to the user effectively and efficiently – Be user-friendly

9

– specify various types of input – compose multimedia objects – Specify attribute types to be extracted and indexed

User interface – Query support

 Multimedia query:

– Diverse – Fuzzy ==> tools:

 By keywords, parameters  mapping problem: « red car »  By example  need input tools: microphone, camera, …

– Searching:

 A very large query  Based on the DB organization  Item randomly chosen

– Browsing: start browsing with

10

– Query refinement: Feedback

User interface – Result presentation

 Many design issues:

+ QoS

– Present all media types + temporal, spatial relationships

long audio segment,

information to large

long video,

browse for: image…

– How to extract and present essential

11

– Reponse time should be short (communication subsystem time + DB search time) – Felicitate relevance feedback and query refinement

Feature extraction

 Determine the retrieval effectiveness  Requirements:

information items

– complete as possible to represent the contents of the

12

– represented and stored compactly – The computation of distance between features should be efficient, otherwise the system response time would be too long

Feature extraction

Levels of feature

Example

Handling Techniques

Meta data

DBMS

Author name, date, title, …

Information Retrieval

Text annotation (captures abstract concepts)

Content description, keywords: happy, sad, …

Content-based retrieval

Audio: frequency distribution, … Image: color distributions, texture,shapes, …

Low-level (data patterns and statistics of a multimedia object, and possibly spatial and temporal relations between parts of the object)

recognize and interpret humans

Content-based retrieval

High-level (attempts to recognize and understand objects)

13

Indexing

 1 Object ~ many features  1 feature ~ many parameters

 Indexing in MIRSs should

 Application classification  Different levels of features  spatial and temporal relationships between objects

14

– be hierarchical and – take place at multiple levels.

Similarity measurement

 Similarity: computed on extracted features retrieval  Relevance of

results:

judged by human

(subjective and context dependent)

? Computed similarity values should be conform to

human judgement – Features used ? – Similarity measure used ?

15

QoS (Quality-of-Service)

 MMData requires: – High bandwidth – large storage space and high transfer rate – delay and jitter bound – and temporal and spatial synchronization

key components:

the operating system – the storage manager – the transport or communications system

16

– hosts (including clients and servers) under the control of

Tổng kết

 Data model  User interface: query support + presentation  Feature extraction  Indexing  Similarity Measurement  Storage

17

18