Simon Willison blogs at http://simon.incutio.com/ xml

17-year-old claims responsibility for Twitter worm

12/04/2009 @ 21:00 UTC

Simon Willison : 17-year-old claims responsibility for Twitter worm - 17-year-old claims responsibility for Twitter worm. It was a text book XSS attack—the URL on the user profile wasn’t properly escaped, allowing an attacker to insert a script element linking out to externally hosted JavaScript which then used Ajax t

Tweenbots: Cute Beats Smart

12/04/2009 @ 15:00 UTC

Simon Willison : Tweenbots: Cute Beats Smart - Tweenbots: Cute Beats Smart. How do you build a robot that can get from one end of Washington Square Park to the other without your help? Give it a cute smile and a sign explaining where it’s going and rely on strangers to point it in the right direct

Rod Begbie : Tweenbots: Cute Beats Smart - Humans instinctively help a dumb-but-cute robot on its journey across a park. I'd like to see these in a race. Perhaps against one un-cute robot with a frowny face.. [via#

A rev="canonical" HTTP Header

12/04/2009 @ 14:00 UTC

Simon Willison : A rev="canonical" HTTP Header - A rev=“canonical? HTTP Header. Chris Shiflett proposes optionally exposing rev=canonical information in an HTTP header, thus allowing sites to discover shorter URLs using just a HEAD request and removing the need to parse HTML. The pingback specific

Revving up

12/04/2009 @ 14:00 UTC

Simon Willison : Revving up - Revving up. Jeremy Keith advocates adding the revcanonical attribute to regular A elements as well as / instead of hiding it in the head of the document, following the microformats design principle that invisible metadata is less valuable than augmentin

Running Rhino and Helma NG on Google App Engine

12/04/2009 @ 14:00 UTC

Simon Willison : Running Rhino and Helma NG on Google App Engine - Running Rhino and Helma NG on Google App Engine. Helma NG is a JavaScript web app framework, which now works on App Engine out of the box.

Using Scala with Google App Engine

11/04/2009 @ 17:00 UTC

Simon Willison : Using Scala with Google App Engine - Using Scala with Google App Engine. Scala works, but I haven’t seen confirmation on actors yet (which are likely to break due to their dependency on threads).

Digg Search: Now With 99.987% Less Suck

11/04/2009 @ 00:00 UTC

Simon Willison : Digg Search: Now With 99.987% Less Suck - Digg Search: Now With 99.987% Less Suck. Really nice implementation of faceted search, still using Lucene and Solr under the hood.

Experiences deploying a large-scale infrastructure in Amazon EC2

10/04/2009 @ 11:00 UTC

Simon Willison : Experiences deploying a large-scale infrastructure in Amazon EC2 - Experiences deploying a large-scale infrastructure in Amazon EC2. “At OpenX we recently completed a large-scale deployment of one of our server farms to Amazon EC2. Here are some lessons learned from that experience.?

Scaling Django web apps on Apache

10/04/2009 @ 11:00 UTC

Simon Willison : Scaling Django web apps on Apache - Scaling Django web apps on Apache. Cool to see this kind of article cropping up on IBM developerWorks, but it’s a shame they don’t mention mod_wsgi.

Browsing my browsing

10/04/2009 @ 10:00 UTC

Simon Willison : Browsing my browsing - Browsing my browsing. Roo Reynolds used the MeeTimer Firefox extension to gather statistics on his browsing habits, then extracted data directly from the SQLite database and generated his own graphs using PHP and the canvas element.

Andy Baio : Roo Reynolds visualizes his web browsing history - he analyzed data from MeeTimer, a lovely Firefox extension for self-tracking

#DataJourn part 1: a new conversation

9/04/2009 @ 12:00 UTC

Simon Willison : #DataJourn part 1: a new conversation - #DataJourn part 1: a new conversation. Journalism.co.uk report on the first instance of a Guardian story that was driven by an external developer’s work with data originally released on our Datablog.

Sphinx 0.9.9-rc2 is out

8/04/2009 @ 15:00 UTC

Simon Willison : Sphinx 0.9.9-rc2 is out - Sphinx 0.9.9-rc2 is out. Interesting new feature: the Sphinx search server now supports the MySQL binary protocol, so you can talk to it using a regular MySQL client library and fire off search queries using SELECT syntax and the new SphinxQL query lang

Dynamic languages on Google App Engine - an overview

8/04/2009 @ 15:00 UTC

Simon Willison : Dynamic languages on Google App Engine - an overview - Dynamic languages on Google App Engine—an overview. Ola Bini’s notes on exploring the new Java support for App Engine with the aim of getting JVM dynamic languages such as JRuby running. Restrictions include a complete lack of threads (which will ma

App Engine: Scheduled Tasks With Cron

8/04/2009 @ 15:00 UTC

Simon Willison : App Engine: Scheduled Tasks With Cron - App Engine: Scheduled Tasks With Cron. Cron tasks simply hit a URL on your application, and can be run as frequently as once a minute. They made up their own syntax, which much nicer than traditional unix cron.

Finding similar items with Amazon Elastic MapReduce, Python, and Hadoop streaming

7/04/2009 @ 11:00 UTC

Simon Willison : Finding similar items with Amazon Elastic MapReduce, Python, and Hadoop streaming - Finding similar items with Amazon Elastic MapReduce, Python, and Hadoop streaming. Tutorial for running Hadoop jobs on Elastic MapReduce using Python and the 2005 Audioscrobbler dataset.

Making the HTML5 time element safe for historians

6/04/2009 @ 15:00 UTC

Simon Willison : Making the HTML5 time element safe for historians - Making the HTML5 time element safe for historians. PPK presents a detailed history of dates and calendars and points out that the HTML5 time element is ill prepared to faithfully represent the kind of dates historians are interested in.

Twitter: blaming Ruby for their mistakes?

6/04/2009 @ 12:00 UTC

Simon Willison : Twitter: blaming Ruby for their mistakes? - Twitter: blaming Ruby for their mistakes?. The comments on the entry include replies from Twitter employees and the RabbitMQ consultant they brought in, and provide a full rebuttal to the various accusations of NIH that were thrown around recently.

Ext Core 3.0 Beta Released

5/04/2009 @ 22:00 UTC

Simon Willison : Ext Core 3.0 Beta Released - Ext Core 3.0 Beta Released. The Ext JavaScript team have just released the core library (similar to jQuery or Prototype) under an MIT license. The rich GUI elements that go on top are still under the GPL.

Building Sites Around Social Objects (Live from Web 2.0) - ReadWriteWeb

5/04/2009 @ 21:00 UTC

deusx : Building Sites Around Social Objects (Live from Web 2.0) - ReadWriteWeb - "This morning at the Web 2.0 Expo in San Francisco, Jyri Engeström, co-founder of Jaiku and now Google employee, spoke about building sites around social objects. What this means is that the social sites we visit today are not just friend networks -

Simon Willison : Building sites around social objects - Building sites around social objects. Jyri Engeström’s concept of “social objects? is a genuinely useful new way to talk about social web sites.

Almost Perfect

5/04/2009 @ 21:00 UTC

Simon Willison : Almost Perfect - Almost Perfect. W. E. Peterson’s book on the rise and fall of WordPerfect Corporation, originally published in 1994 and now available for free online.  [via]

UK Television Series Map

5/04/2009 @ 21:00 UTC

Simon Willison : UK Television Series Map - UK Television Series Map. Inspired by the US sitcom map, Meg Pickard is plotting TV series on a map of the UK.

Mending The Bitter Absence of Reasoned Technical Discussion

5/04/2009 @ 21:00 UTC

Simon Willison : Mending The Bitter Absence of Reasoned Technical Discussion - Mending The Bitter Absence of Reasoned Technical Discussion. Not at all surprised to see Alex Payne write this considering the low quality of discussion around anything technical to do with Twitter.

TinyURL - Archiveteam

4/04/2009 @ 01:00 UTC

Simon Willison : TinyURL - Archiveteam - TinyURL—Archiveteam. Excellent: the Internet Archive are crawling TinyURL (and hopefully other URL shortening services as well). The wiki page was created back in January.

Introducing Digg's IDDB Infrastructure

3/04/2009 @ 22:00 UTC

Simon Willison : Introducing Digg's IDDB Infrastructure - Introducing Digg’s IDDB Infrastructure. IDDB is Digg’s new infrastructure component for sharding data across multiple databases, with support for both MySQL and memcachedb. “The DiggBar and URL minifying service is powered by a 16 machine IDDB clu

deusx : Digg the Blog » Blog Archive » Introducing Digg’s IDDB Infrastructure - "The DiggBar and URL minifying service is powered by a 16 machine IDDB cluster, which includes 8 write masters in the index and 8 MySQL storage nodes. It’s, to date, the largest IDDB cluster Digg has pushed into production, but we have plans for mu

Automating PowerPoint with Python

3/04/2009 @ 17:00 UTC

Simon Willison : Automating PowerPoint with Python - Automating PowerPoint with Python. Useful tutorial on using ActivePython’s win32com module to automate PowerPoint. The example code pulls in the top 50 banks by assets from the Guardian Data Store and generates a treemap using PowerPoint’s shape dra

xml
Upian.