Archive for the 'data' Category

Faux Data: Infographics

The theoretical physicist Geoffrey West criticised existing accepted thought in urban theory before coming up with a set of constants that defined the relationship between city size and the output of it’s citizens (Each time a city doubles in size it’s per capita innovation, income, etc increases by 15% – and likewise the negative social actions of crime, pollution…). Previously he found a similar efficiency in biology where the larger an organism was the less energy per unit mass it required to go about it’s life.

It’s this track record in reducing a problem domain to a simple set of rules & constraints that is so impressive. The way in which theoretical physics practitioners go about solving for x – the sense of minimalism that drives the crunching of gigs of data and seemingly chaotic environment into understandable, predictable systems. It’s raw data visualisation in it’s purest form.

And then we have the humble infographic. Continue reading ‘Faux Data: Infographics’

How to Opt Out of Targeted Behavioural Advertising


Behavioural advertising involves the tracking of a web user’s surfing and displaying advertising that matches this data. I find the tracking of my surf history unnecessarily obtrusive personally and today found the online tool that will prevent marketing companies from collecting this data and profiting from it:

http://www.networkadvertising.org/managing/opt_out.asp

Incidentally I came by this information by way of Rapleaf, Continue reading ‘How to Opt Out of Targeted Behavioural Advertising’

When to do Real Time

Image courtesy jayce 31

Google has done two ‘real-time’ things lately, one good one not so good: Real Time web indexing and real time web search.

With ‘er, hang-on a minute…‘ moments now surfacing in the public domain I find the contrast between the two to be especially important. Google in their traditional engineer style expound the benefits of both in shaving seconds of search: ’11 user hours saved globally each second’; ’50% faster indexing rate of content’; figures that prove the mantra – machines search better than humans.

Machines definitely do the donkey work better than humans. Continue reading ‘When to do Real Time’

Google App Engine Datastore Gotchas

stormclouds gather
image courtesy johnson7

App Engine is generally a new paradigm for webapp developers; replacing sessions with memcache and a schemaless datastore just two elements requiring new thinking for old problems. Unfortunately there are a few more hidden nuisances which have the potential to waste programming time relatively early on. Here’s four of my personal head-bangers:

1. the datastore doesn’t always store Properties
I’ve had trouble with it refusing to store arbitrary entity props unless I assign them in the entity constructor itself (these fields were optional btw). Just setting prop values after initialisation then put() on the ds didn’t write them.

Continue reading ‘Google App Engine Datastore Gotchas’