~ubuntu-branches/ubuntu/raring/readosm/raring

« back to all changes in this revision

Viewing changes to mainpage.doxy

Committer: Package Import Robot
Author(s): David Paleino
Date: 2012-10-07 17:24:29 UTC
Revision ID: package-import@ubuntu.com-20121007172429-lv8oyiu086t7henm

Tags: upstream-1.0.0a+dfsg1

Import upstream version 1.0.0a+dfsg1

files added:

AUTHORS

COPYING

ChangeLog

Doxyfile.in

INSTALL

Makefile.am

Makefile.in

NEWS

README

aclocal.m4

config.guess

config.h.in

config.sub

configure

configure.ac

depcomp

examples

examples/Makefile.am

examples/Makefile.in

examples/examples.doxy

examples/test_osm1.c

examples/test_osm2.c

examples/test_osm3.c

headers

headers/Makefile.am

headers/Makefile.in

headers/readosm.h

headers/readosm_internals.h

headers/readosm_protobuf.h

install-sh

ltmain.sh

m4/libtool.m4

m4/ltoptions.m4

m4/ltsugar.m4

m4/ltversion.m4

m4/lt~obsolete.m4

mainpage.doxy

makefile.vc

missing

nmake.opt

readosm.pc.in

src/Makefile.am

src/Makefile.in

src/osm_objects.c

src/osmxml.c

src/protobuf.c

src/readosm.c

Show diffs side-by-side

added added

removed removed

mainpage.doxy

/** \mainpage notitle

\section Introduction

ReadOSM is a C open source library to extract valid data from within an Open

Street Map input file. Such OSM files come in two different formats:

- files identified by the .osm suffix simply are plain XML files.

- files identified by the .pbf suffix contain the same

data, but adopting the Google's Protocol Buffer serialization format (a

more concise and compressed binary notation, thus requiring much less

storage space).

The ReadOSM design goals are:

- to be simple and lightweight

- to be stable, robust and efficient

- to be easily and universally portable.

- making the whole parsing process of both .osm or .pbf files

completely transparent from the application own perspective.

ReadOSM is structurally simple and quite light-weight (typically about 20K of object

code, stripped). ReadOSM has only two key dependencies:

- zlib (the well known ZIP library), which is used to decompress zipped binary

blocks internally stored within .pbf files.

- expat (a widely used XML parsing library), which is used to parse XML .osm files.

- both libraries are widely available on many platforms.

Building and installing ReadOSM is straightforward:

\verbatim

./configure

make

make install

\endverbatim

Linking ReadOSM to your own code is usually simple:

\verbatim

gcc my_program.c -o my_program -lreadosm

\endverbatim

On some systems you may have to provide a slightly more complex arrangement:

\verbatim

gcc -I/usr/local/include my_program.c -o my_program \

-L/usr/local/lib -lreadosm -lexpat -lz

\endverbatim

ReadOSM also provides pkg-config support, so you can also do:

\verbatim

gcc `pkg-config --cflags readosm` my_program.c -o my_program `pkg-config --libs readosm`

\endverbatim

I originally developed ReadOSM simply in order to allow the SpatiaLite's

own CLI tools to acquire both OSM .osm and .pbf files indifferently.

Anyway I feel that supporting OSM files import/parsing in a simple and easy

way could be useful to many other developers, so I quickly decided to

implement all this stuff as a self-standing library.

ReadOSM is licensed under the MPL tri-license terms: you are free to choose the

best-fit license between:

- the MPL 1.1

- the GPL v2.0 or any subsequent version

- the LGPL v2.1 or any subsequent version

Enjoy, and happy coding

/** \page intro About Open Street Map datasets

Open Street Map aka \b OSM [http://www.openstreetmap.org/] is a very popular

community project aimed to produced a map of the world; this map is absolutely

free and is released under the CC-BY-SA license terms

[http://creativecommons.org/licenses/by-sa/2.0/].

Selected portions [by Country / Region] of the OSM map are available on the

following download sites:

- http://download.geofabrik.de/

- http://downloads.cloudmade.com/

The best known format used to ship OSM datasets is based on XML; we'll

shortly examine the XML general layout so to explain the objects used

by the OSM data model and their mutual relationships.

\section Node

A Node simply corresponds to a 2D POINT Geometry; the geographic coordinates

are always expressed as Longitude and Latitude (corresponding to SRID 4326).

A Node doesn't simply have a geometry; it's usually characterized by several data

attributes:

- \b id: a number uniquely identifying each Node object.

- \b lon and \b lat: the geographic Longitude and Latitude of the Point.

- \b version: a progressive number identifying subsequent versions of the same object.

- \b changeset: a progressive number identifying a "changeset", i.e. a batch insert/update

performed by same user.

- \b user: nickname of the user committing the changeset.

- \b uid: a number uniquely identifying the user

- \b timestemp: commit date-time

- \b tag-list: any object may eventually be further qualified using arbitrary \b key:value pairs.

The following is the XML general layout used to represent a Node object:

\verbatim

100

101

102

</node>

103

\endverbatim

104

105

\section Way

106

107

A Way corresponds to a 2D LINESTRING Geometry: anyway the vertices never are directly

108

defined within the Way itself; a list of indirectly referenced Nodes (<nd ref> items) is required instead.

109

The data attributes characterizing a Way are more or less the same used for Nodes, and with identical meaning;

110

and for Ways too an arbitrary collection of Tags (\b key:value pairs) is supported.

111

112

The following is the XML general layout used to represent a Way object:

113

\verbatim

114

115

116

117

118

119

120

</way>

121

\endverbatim

122

123

\section Relation

124

125

A Relation is a complex object: it can correspond to a 2D POLYGON, or to a 2D MULTILINESTRING, or even to a 2D GEOMETRYCOLLECTION.

126

A Relation object can reference any other kind of OSM objects: each <member> item can address a Node object,

127

a Way object or another Relation object; the \b type attribute will always specify the nature of the referenced object,

128

and the optional \b role attribute may eventually better specify the intended scope.

129

The data attributes characterizing a Relation are exactly the same used for Ways, and with identical meaning;

130

and for Relations too an arbitrary collection of Tags (\b key:value pairs) is supported.

131

132

The following is the XML general layout used to represent a Relation object:

133

\verbatim

134

135

136

137

138

139

</relation>

140

\endverbatim

141

142

143

/** \page formats Open Street Map file formats

144

145

There are two distinct formats used to ship OSM datasets: both contains the exact same

146

information, but the internal layout is radically different.

147

148

\section osm XML (.osm) files

149

150

OSM files based on the XML notation are widely used: usually they are identified by the .osm suffix.

151

XML is notoriously verbose and usually requires lots of storage space; happily enough, XML it's strongly compressible.

152

Accordingly to this consideration, the most commonly found OSM files are identified by the .osm.bz2 suffix:

153

this practically means that the .osm (XML) file has been compressed using bzip2.

154

In order to actually process a .osm.bz2 OSM file a two-steps approach is always required:

155

- decompressing the file (using bunzip2 or some other tool)

156

- then parsing the resulting .osm file

157

- please note: the inflated file will require about 10/15 times the amount space required

158

by the compressed file; many OSM XML files could actually be impressively huge (several GB).

159

160

\section pbf Protocol Buffer (.pbf) files

161

162

An alternative OSM file format is based on the Google's Protocol Buffer encoding

163

[https://developers.google.com/protocol-buffers/docs/encoding]

164

This OSM format is based on a public and documented specification: [http://wiki.openstreetmap.org/wiki/PBF_Format]

165

166

OSM files based on Protocol Buffer encoding are usually identified by the .pbf suffix.

167

The main benefit coming from using .pbf files is in that they are much more compact

168

(smaller size) than the corresponding .osm.bz2; and they can be immediately parsed, no

169

preliminary decompression step being required at all.

170

171

\section readosm Why using ReadOSM ?

172

173

The intended scope of ReadOSM is to allow transparent parsing of both OSM formats indifferently.

174

There is no need to take care of any internal low-level aspect, because the library itself silently handles any required step.

175

The simple and easy abstract interface implemented by ReadOSM is exactly intended so to allow many

176

reader-apps to consume OSM-input files in the most painless way; and all this requires only a

177

very limited memory footprint.

178

179

180

181

/** \page readosm ReadOSM basic architecture

182

183

ReadOSM implements a very simple and straightforward interface; there are only three methods:

184

- readosm_open(): this function is intended to establish a connection to some OSM input file.

185

- readosm_close(): this function is intended to terminate a previously established connection.

186

- readosm_parse(): a single function dispatching the whole parsing process (mainly based on callback functions).

187

188

Accordingly to the above premises, implementing a complete OSM parser is incredibly simple:

189

190

\verbatim

191

#include <readosm.h>

192

193

static int

194

parse_node (const void *user_data, const readosm_node * node)

195

{

196

/* callback function consuming Node objects */

197

struct some_user_defined_struct *my_struct =

198

(struct some_user_defined_struct *) user_data;

199

200

... some smart code ...

201

202

return READOSM_OK;

203

}

204

205

static int

206

parse_way (const void *user_data, const readosm_way * way)

207

{

208

/* callback function consuming Way objects */

209

struct some_user_defined_struct *my_struct =

210

(struct some_user_defined_struct *) user_data;

211

212

... some smart code ...

213

214

return READOSM_OK;

215

}

216

217

static int

218

parse_relation (const void *user_data, const readosm_relation * relation)

219

{

220

/* callback function consuming Relation objects */

221

struct some_user_defined_struct *my_struct =

222

(struct some_user_defined_struct *) user_data;

223

224

... some smart code ...

225

226

return READOSM_OK;

227

}

228

229

int main ()

230

{

231

/* the basic OSM parser implementation */

232

int ret;

233

const void *handle;

234

struct some_user_defined_struct my_struct;

235

236

ret = readosm_open ("path-to-some-OSM-file", &handle);

237

238

... error handling intentionally suppressed ...

239

240

ret = readosm_parse (handle, &my_struct, parse_node, parse_way, parse_relation);

241

242

... error handling intentionally suppressed ...

243

244

ret = readosm_close (handle);

245

246

... error handling intentionally suppressed ...

247

248

return 0;

249

}

250

\endverbatim

251

252

So the real programming work is simply the one required in order to implement the callback-functions own code.

253

You can usefully read and study the Examples code-samples in order to get any other relevant information about this topic.

254

255

Older »