A Spatio Temporal Model for Massive Analysis of Shapefiles
Over the course of time an organization working with geospatial data accumulates tons of data both in the form of vector and raster formats. This data is a result of co ordinated processes within the organization and external sources such as other collaborative organizations, projects and agencies, crowd sourcing efforts, etc. The massive amount of data accumulated as a result and the recent developments in the distributed processing of geo data have catalyzed the development of our spatio temporal data processing model. Our model is loosely based upon GS Hadoop and uses dataset consisting of more than 300,000 shapefiles a vector data format , which was accumulated over a span of several years from various sources and creation of custom geo portals for government departments. The developed model provides access to visual representation, extraction of features and related attributes from over more than 800 GB of shapefile data containing ten of billions of features. In this paper, we model a spatial data infrastructure for processing such huge amount of geo data.
Shapefiles spatio temporal processing spatial processing feature extraction
Dr. Prabha Shreeraj Nair