Part of Speech Tags 3

A frequently asked question is “What do the Part of Speech tags (VB, JJ, etc) mean?” The bottom line is that these tags mean whatever they meant in your original training data. You are free to invent your own tags in your training data, as long as you are consistent in their usage. Training data ...

Support for SciPy in NLTK’s Maximum Entropy methods

Recently I have been working with the Maximum Entropy classifiers in NLTK. Maximum entropy models are similar to the well known Naive Bayes models but they allow for independence between the features – i.e. they are not “naive”. SciPy has had some problems with its Maximum Entropy code, and v0.8 must be used. v0.9 crashes ...

The NLP Stack

Processing natural language is a complicated business. Not that long ago it seemed to be an intractable problem to many people. Although full understanding is still a cutting edge research problem, large areas of natural language processing have become practical on most computing platforms. This is especially true when NLP techniques are applied to restricted ...

Why Python and NLTK?

Most modern natural language processing (NLP) depends heavily on statistics and complex statistical models. So why use Python,  a relatively slow scripting language,  for NLP? Python’s strengths are in its text, list, and structure support. Structures are weakly typed, but supported by a powerful set of language constructs in the form of list comprehensions and ...

Technical Overview: Marble

Marble is an open source digital globe application that is capable of displaying a number of local and Internet data sources. Intended for educational applications, Marble could be likened to an “open source Google Earth”. Marble is cross-platform and has versions for the KDE environment and Qt.

Generating Google Map Heatmap Mashups from your geo-related data

de/numerics have released a web service for creating heatmap translucent color overlays of geographic-based data on Google maps. No programming is required, but this service can be accessed using a PHP API. This article shows you how to use this service using the PHP programming interface.

An Overview of OpenAddresses.org

The aim of OpenAddresses.org is to establish a copyright-free database of geocoded addresses, using donations and voluntary help. Such a database has a wide range of applications including spatial analysis and geomarketing. The database is intended to be of greater accuracy than navigation systems which typically have insufficient accuracy for microgeographic analysis. OpenAddresses includes a ...

Technical Overview: OpenScales

OpenScales is an open source mapping framework for ActionScript 3 and Flex. Supporting a range of OGC data protocols, it has many similarities to the OpenLayers framework but uses Flash rather than JavaScript. The combination of Flash, ‘tweening’, and fast data providers results in a smoother, more compelling user experience – remniscent of Bing Map’s ...

Technical Overview: GeoNames

GeoNames is a series of web services to an extensive geographical database of over 8,000,000 names. The database is also available for “bulk” download. Both access methods are under the Creative Commons attribution license. The web services are available through free and commercial interfaces, and using a number of different format/protocols.

Technical Overview: Google Maps Data API

The Google Maps Data API is a web service that allows client applications to view, store, and update  map data for Google Maps, using the Google Data API. Data is stored in the form of individual features and collections of features. Full pricing information has yet to be released, but the service is currently free ...