Lecture 3: Lecture 3: GIS Data Models GIS Data Models

To introduce various GIS data models AIM:AIM: To introduce various GIS data models

After this lecture you should be able to: After this lecture you should be able to:

Describe the primary Vector data models Describe the primary Vector data models used in GIS and give examples used in GIS and give examples Describe raster data models and give Describe raster data models and give examples examples Describe TIN data models Describe TIN data models Explain “topology” Explain “topology” Describe the main file formats used in GIS Describe the main file formats used in GIS

Graphic Features Graphic Features

Objects which are shown on a map are Objects which are shown on a map are called graphic features or features. called graphic features or features. These features may be natural or man These features may be natural or man made objects. made objects.

Digital representation of physical or man made Digital representation of physical or man made elements: elements:

Vectors Vectors  Points or Nodes Points or Nodes  Lines or Arcs Lines or Arcs  Polygons Polygons Raster Cells or Pixels Raster Cells or Pixels  Images Images  Digital Orthophotography Digital Orthophotography

Graphic Features Graphic Features

Images Images Vector Vector Raster (GRID) Raster (GRID) Attributes Attributes TIN ( Triangulated Irregular Network)) TIN (Triangulated Irregular Network Annotation Annotation

© Paul Bolstad, GIS Fundamentals

Geographic Data Types Geographic Data Types

© Paul Bolstad, GIS Fundamentals

Geographic Data Types Geographic Data Types

• RASTER

• VECTOR

• Real World

Source: Defense Mapping School National Imagery and Mapping Agency

DIGITAL SPATIAL DATA DIGITAL SPATIAL DATA

Electronic Maps in Raster and Electronic Maps in Raster and Vector Formats Vector Formats

Raster Data Sources Raster Data Sources

Air Photos

Satellite Imagery

Scanned Maps

A scanned photograph that

has been mathematically rectified to eliminate the

effects of displacement so

that its view always appears as though it is perpendicular

to the ground.

Digital Orthophotograph Digital Orthophotograph

Image Copyright 1993 Nassau County, NY

Scanned Scanned Document Document

Raster Maps (images) Raster Maps (images)

 A set of colored pixels A set of colored pixels representing chart representing chart information as a picture on information as a picture on computer screen computer screen

 Simply an array of pixels Simply an array of pixels arranged in rows and arranged in rows and columns columns

 Pixels are color coded, but Pixels are color coded, but do not represent features do not represent features explicitly explicitly

 Rasters can have value Rasters can have value attached attached

Source: Defense Mapping School National Imagery and Mapping Agency

The Appeal of Raster The Appeal of Raster

 Looks like a paper map Looks like a paper map

 Cheap and easy to produce (scan Cheap and easy to produce (scan existing paper chart) existing paper chart)

 Runs easily on PC Runs easily on PC

 World-wide availability w/updates World-wide availability w/updates available available

 Problems: Problems:

 Datum may or may not be WGS84 Datum may or may not be WGS84  Does not allow automated grounding Does not allow automated grounding avoidance or any other computer- avoidance or any other computer- aided analysis aided analysis  Storage not as efficient Storage not as efficient

Source: Defense Mapping School National Imagery and Mapping Agency

An intelligent form of digital An intelligent form of digital geographic data where real world geographic data where real world objects are represented by points, objects are represented by points, lines and polygons. An objects lines and polygons. An objects representation is described by representation is described by attributes and coordinates. attributes and coordinates.

Examples include: Examples include: Digitized Maps Digitized Maps GIS Data GIS Data

Source: Defense Mapping School National Imagery and Mapping Agency

Vector Data Vector Data

 In addition to

Linking Attributes and Linking Attributes and Graphics Graphics

eography, In addition to GGeography, GIS is also about GIS is also about IInformation. A GIS marries nformation. A GIS marries the capabilities of computer the capabilities of computer mapping with database mapping with database management systems. So, management systems. So, a geospatial analyst can a geospatial analyst can “click” on a geographic “click” on a geographic feature and find out feature and find out information about it. Or, information about it. Or, they can query the database they can query the database to find out information about to find out information about all the geographic objects all the geographic objects that meet a specific criteria. that meet a specific criteria.  Information about a Information about a geographic object is called geographic object is called attribute.. an an attribute

Linking Attributes with Graphics Linking Attributes with Graphics

 In this example, the In this example, the user has issued a user has issued a query to select all query to select all the properties the properties whose classification whose classification is single family is single family residential (210), residential (210), and colored the and colored the selection red. selection red.

1

5

Vector Data is Layered Vector Data is Layered

Features: Line Points:

2 • • 4

•Lake • Lake

Connected Detached

7

6

Area Text

• •

Spatial Data Analysis

30ł 30’ N 040ł 40’ E

30ł 30’ N 040ł 50’ E

Geographic data are separated into themes, tiled for speed of access and placed on CD-ROM Network Analysis

Source: Defense Mapping School National Imagery and Mapping Agency

Source: Defense Mapping School National Imagery and Mapping Agency

Paradigm Shift Paradigm Shift May look “different” from a May look “different” from a paper map (software- paper map (software- dependent) dependent) Underlying database allows Underlying database allows queries and layer selection queries and layer selection Zooming reveals detail Zooming reveals detail Complex, expensive, and Complex, expensive, and time-consuming to produce time-consuming to produce

Raster and Vector Data Models Raster and Vector Data Models

Real World

600

Trees

500

1 2 3 4 5 6 7 8 9 10 G G G

B B

400

Trees

Y-AXIS

G G

300

B B G G B B

BK

House

B

200

River

100

G G

200

500

100

300

600

B B B B

1 2 3 4 5 6 7 8 9 10

Raster Representation

400 X-AXIS Vector Representation

Source: Defense Mapping School National Imagery and Mapping Agency

A GIS in which graphic data is stored in the A GIS in which graphic data is stored in the form of discrete points, lines, or polygons. form of discrete points, lines, or polygons.

Vector GIS Vector GIS

© Paul Bolstad, GIS Fundamentals

 There is a relationship between vector data types. There is a relationship between vector data types. Each data type is often dependent upon one Each data type is often dependent upon one another. When representing geographic data in another. When representing geographic data in vector format, data is typically stored as: vector format, data is typically stored as:

 Points

Point

 Lines

Line

Polygon

Structure of Vector Data Structure of Vector Data

Points: points are zero dimensional objects, and : points are zero dimensional objects, and represent geographic features such as wells, represent geographic features such as wells, sample locations, or trees. sample locations, or trees. Lines: lines represent one dimensional objects, or : lines represent one dimensional objects, or linear features, such as road and stream linear features, such as road and stream centerlines. Lines are made up of a series of centerlines. Lines are made up of a series of interconnected points. A line typically starts and interconnected points. A line typically starts and , and the node, and the end with a special point called a node end with a special point called a points that make up the rest of a line are called points that make up the rest of a line are called vertices. . vertices  Polygons/Area Polygons/Area: polygons represent two : polygons represent two dimensional objects such as the boundaries of a dimensional objects such as the boundaries of a field, or property, or the outline of a building or lake. field, or property, or the outline of a building or lake. Polygons are made up of a series of connected Polygons are made up of a series of connected lines where the starting point of a polygon is the lines where the starting point of a polygon is the same as the ending point. same as the ending point.

Points are zero dimensional objects

which have locations and attribute information but are too small to be

represented as areas.

Points Points

Soil Samples Soil Samples  TypeType  PHPH  Contaminants Contaminants Utility Poles Utility Poles  Owner Owner  Height Height  Attachments Attachments

 Spill Locations Spill Locations  Accident Number Accident Number  Type of Spill Type of Spill  Extent Extent  Parcel Centroids Parcel Centroids  Section/Block/Lot No. Section/Block/Lot No.  Address Address  Owner Owner  Assessment Data Assessment Data

Points Points

Light Poles Light Poles

Parcel Centroids Parcel Centroids

Lines are one dimensional objects Lines are one dimensional objects which have length but no area. which have length but no area. Each line must begin and end at a Each line must begin and end at a node. node.

Lines or Arcs Lines or Arcs

Stream Stream  DepthDepth  Quality Quality  Flow Rate Flow Rate

Street Centerline Street Centerline  Street Name Street Name  Address Ranges Address Ranges Water Main Water Main  Pipe size Pipe size  Pipe Material Pipe Material  Date Installed Date Installed

Lines or Arcs Lines or Arcs

Street Centerlines Street Centerlines

Polygons are closed mathematical Polygons are closed mathematical figures of any shape or size. They are figures of any shape or size. They are formed by a series of connected lines. formed by a series of connected lines.

Polygons Polygons

Parcel Parcel Parcel ID Number Parcel ID Number Dimensions and Area Dimensions and Area Soil Boundaries Soil Boundaries TypeType Permeability Permeability Flood Zones Flood Zones

Polygons Polygons

Polygons Polygons

Polygons Polygons

Linked” Attribute Data ““Linked” Attribute Data

Tabular Data Tabular Data (Attributes) (Attributes)

© ESRI

Information which describes an entity Information which describes an entity represented by a graphic feature. represented by a graphic feature.

Attribute Data Attribute Data

Information about a graphic feature Each piece of attribute information is related Each piece of attribute information is related to a specific graphic element to a specific graphic element Traditional database type fields Traditional database type fields Images Images Photos Photos Video Video

Attribute Data Attribute Data

Linked” Attributes ““Linked” Attributes

Courtesy Village of Garden City

A GIS in which graphic data is stored in A GIS in which graphic data is stored in the form of grid cells or pixels. the form of grid cells or pixels.

Part 3: Raster GIS Part 3: Raster GIS

Geographic Data Sets Geographic Data Sets  Land Use/Land Cover Land Use/Land Cover  Vegetation Index Vegetation Index  Soil Stability Soil Stability Digital Photography Digital Photography  Building Photos Building Photos  Accident Scenes Accident Scenes  Crop Damage Crop Damage  Full Motion Video Full Motion Video Digital Orthophotography Digital Orthophotography  Rectified Aerial Photos Rectified Aerial Photos

Raster Data Raster Data

© ESRI, Modeling Our World

Abbreviation for PICTURE ELEMENT Abbreviation for

, which is the PICTURE ELEMENT, which is the

smallest unit in an image. In raster based GIS smallest unit in an image. In raster based GIS systems, attribute information can be assigned to systems, attribute information can be assigned to each pixel. each pixel.

PIXEL PIXEL

© ESRI, Modeling Our World

© ESRI, Modeling Our World

Matrix of Equal-Area Cells Matrix of Equal-Area Cells

Raster Data Structure Raster Data Structure

WHEAT

APPLES

WHEAT

APPLES

HAY

HAY

Issue of Resolution Issue of Resolution

Must Consider Must Consider Discreteness of the entity being depicted Discreteness of the entity being depicted Intended application (efficiencies) Intended application (efficiencies) Source data Source data Storage considerations Storage considerations  Resolution Resolution  ColorColor

Raster Vs. Vector Raster Vs. Vector

People often ask: “what is the best format to represent geographic objects in a GIS, raster of vector?” The answer is actually neither and both! That is, neither data model is better in every circumstance, and both data models fulfill very specific roles in GIS.

Shoreline Shoreline

Water Water

Land Land

In this example, the vector data source representing the shoreline appears to have greater detail, and possibly greater accuracy. Vector data can often store the information in a more compact format than raster data, and also work well with linear objects such as stream networks.

Vector Shoreline Vector Shoreline

However, raster data models are much better at representing information that is continuous in nature, such as temperature where the value of temperature may be different between neighbors.

Source: Defense Mapping School National Imagery and Mapping Agency

Raster vs. Raster vs. Vector Model Vector Model

Advantages Advantages  Good representation of reality Good representation of reality  Compact data structure Compact data structure  Topology can be described in a network Topology can be described in a network  Accurate graphics Accurate graphics Disadvantages Disadvantages  Complex data structures Complex data structures  Simulation may be difficult Simulation may be difficult  Some spatial analysis is difficult or impossible to Some spatial analysis is difficult or impossible to perform perform

Vector – Advantages and Vector – Advantages and Disadvantages Disadvantages

 Advantages Advantages  Simple data structure Simple data structure  Easy overlay Easy overlay  Various kinds of spatial analysis Various kinds of spatial analysis  Uniform size and shape Uniform size and shape  Cheaper technology Cheaper technology  Disadvantages Disadvantages  Large amount of data Large amount of data  Less “pretty” Less “pretty”  Projection transformation is difficult Projection transformation is difficult  Different scales between layers can be a nightmare Different scales between layers can be a nightmare  May lose information due to generalization May lose information due to generalization

Raster – Advantages and Raster – Advantages and Disadvantages Disadvantages

Raster Functions

© Paul Bolstad, GIS Fundamentals

Triangulated Irregular Network Triangulated Irregular Network

 A triangulated irregular network

(TIN) is a data model that is used to represent three dimensional objects. In this case, x,y, and z values represent points. Using methods of computational geometry, the points are connected into what is called a triangulation, forming a network of triangles. The lines of the triangles are called edges, and the interior area is called a face, or facet.  While the TIN model is somewhat

more complex than the simple point, line, and polygon vector model, or the raster model, it is actually quite useful for representing elevations. For example a raster grid would require grid cells to cover the entire surface of a geographic area. Also, if we wanted to show great detail we would have to have small grid cells. Now, if the land area is relatively flat, we would still need the small grid cells. However, with a TIN we would not have to include so many points on the flat areas, but could add more points on the steep areas where we want to show greater detail.

 The illustration shows how we can create a TIN of the terrain around Ithaca, NY.

 The tin data structure is The tin data structure is based on two basic elements: based on two basic elements:

 points with x,y,z values, and a points with x,y,z values, and a series of edges joining these points series of edges joining these points to form triangles. to form triangles.  This triangular mosaic forms a This triangular mosaic forms a continuous faceted surface, much continuous faceted surface, much like a jewel. like a jewel.  TIN’s triangulation method satisfies TIN’s triangulation method satisfies the Delaunay criterion the Delaunay criterion

TIN Data Structures TIN Data Structures

Delaunay Criterion The triangulation algorithm ensures that the Delauney criterion is satisfied. The Delauney criterion is such that the circumcircle of a triangle does not enclose a node on any other element. The circumcircle of a triangle is the circle that passes through its vertices.

Components of a TIN Components of a TIN

 TIN is a set of adjacent, non-overlapping triangles computed from irregularly spaced points with x, y coordinates and z values.

 ArcGIS supports two data models of elevation: TIN

and lattice.

Nodes Nodes Edges Edges Triangles Triangles HullHull Topology Topology

 Nodes:

 Edges:

 Triangles:

Nodes: nodes are the fundamental building blocks of the tin. The nodes nodes are the fundamental building blocks of the tin. The nodes originate from the points and arc vertices contained in the input data originate from the points and arc vertices contained in the input data sources. sources. every node is joined with its nearest neighbors by edges to form Edges: every node is joined with its nearest neighbors by edges to form triangles which satisfy the Delaunay criterion. Each edge has two nodes, triangles which satisfy the Delaunay criterion. Each edge has two nodes, but a node may have two or more edges. but a node may have two or more edges. Triangles: Each triangular facet describes the behavior of a portion of the Each triangular facet describes the behavior of a portion of the tin’s surface. tin’s surface.

 Topology:

 Hull:Hull: The hull of tin is formed by one or more polygons containing the entire The hull of tin is formed by one or more polygons containing the entire set of data points used to construct the tin. The hull polygons define the set of data points used to construct the tin. The hull polygons define the zone of interpolation of the tin. zone of interpolation of the tin. Topology: The topological structure of a tin is defined by the relationship The topological structure of a tin is defined by the relationship between nodes, edges (number and type), and by the relationship between nodes, edges (number and type), and by the relationship between adjacent triangles. between adjacent triangles.

 Like an ARC/INFO coverage, a tin is stored as a directory of Like an ARC/INFO coverage, a tin is stored as a directory of files. However, a tin is not a coverage and has no associated files. However, a tin is not a coverage and has no associated INFO files. A tin directory contains seven files containing INFO files. A tin directory contains seven files containing information about the tin surface. These files are encoded in information about the tin surface. These files are encoded in binary format and are not readable by standard text display. . binary format and are not readable by standard text display

Components of TIN (in detail) Components of TIN (in detail)

© ESRI, Modeling Our World

© ESRI, Modeling Our World

Text or labels plotted on a map which describe or identify a feature or add other information to the map.

Annotation Annotation

TextText

 Information placed on a map to provide Information placed on a map to provide additional information about a graphic feature. additional information about a graphic feature.

Symbols Symbols  The means by which graphic features are The means by which graphic features are displayed in a GIS. displayed in a GIS.

Annotation Annotation

Symbols help identify features and Symbols help identify features and provide information about them. provide information about them. q c , P X q c , P X

Symbols Symbols

Annotation, Symbology, and Text Annotation, Symbology, and Text

Courtesy Village of Garden City

 Engineering drawing – building outline Engineering drawing – building outline  USGS Quad Sheet – point location USGS Quad Sheet – point location

 CityCity

 USGS Quad Sheet – Polygon of city boundary USGS Quad Sheet – Polygon of city boundary  Rand McNally Map – Point representing city location Rand McNally Map – Point representing city location

Issues of Scale in Feature Issues of Scale in Feature Representation Representation Depending upon the presentation scale, a feature Depending upon the presentation scale, a feature may take on different forms may take on different forms  Buildings Buildings

Topology Topology

A collection of numeric data which clearly A collection of numeric data which clearly , and containment, and describes adjacency describes

adjacency, , containment

between map features connectivity between map features connectivity and and which can be stored and manipulated by a which can be stored and manipulated by a computer. computer.

© Paul Bolstad, GIS Fundamentals

Topology Topology

ArcView is a file based, non topological, ArcView is a file based, non topological, pseudo object-oriented graphic data pseudo object-oriented graphic data structure structure

ARC/INFO vs. ArcView ARC/INFO vs. ArcView ARC/INFO is a topologically based hybrid ARC/INFO is a topologically based hybrid system system

ARC/INFO Coverage ARC/INFO Coverage ArcView Shape ArcView Shape Geodatabase Layer Geodatabase Layer AutoCAD DWG/DXF AutoCAD DWG/DXF MapINFO MapINFO OtherOther

File Formats File Formats

 The shapefile format defines the geometry and The shapefile format defines the geometry and attributes of geographically-referenced features attributes of geographically-referenced features in as many with specific file extensions that in as many with specific file extensions that must be stored in the same project workspace. must be stored in the same project workspace. They are: They are:  .shp - the file that stores the feature geometry. .shp - the file that stores the feature geometry. Required. Required.  .shx - the file that stores the index of the feature .shx - the file that stores the index of the feature geometry. Required. geometry. Required.  .dbf - the dBASE file that stores the attribute .dbf - the dBASE file that stores the attribute information of features. Required. information of features. Required.  .sbn and .sbx - the files that store the spatial index of .sbn and .sbx - the files that store the spatial index of the features. Optional. the features. Optional.

The Shape File The Shape File

© ESRI, Modeling Our World

© ESRI, Modeling Our World

Homework Homework

Reading: Reading: A PDF document on data models A PDF document on data models Exercise: Exercise: GTK 5abcd (Symbolizing features and rasters) GTK 5abcd (Symbolizing features and rasters)