November 26, 2012 by yorksranter 0 comments

A Project Lobster progress report!

So I completely forgot I needed to register for OKFN’s Open Interests Europe hackathon last weekend, which even had a lobbying track, and just round the corner from the office, too.

I decided to have my own lobbying hackathon by ~~eating pizza and caffeine pills and being misogynistic~~ spending my weekend finishing the Lobster Project’s analytics scrapers for ministers and lobbies respectively. I abandoned the plan of generating NetworkX objects and storing them in the database for later use in favour of directly generating them and reading out the metrics, and dealing with the performance hit by writing slightly less horrible code.

Specifically, I decided to optimise for fewer calls to the database API. Memoising the rankings function cuts its usage from two calls a meeting to 82 for the first month, plus any future changes, and storing the cache itself means that only new combinations of ministers and titles generate a query in future runs. Getting all the lobbies for the month in one query, and then processing them in Python using itertools, replaces one query for each meeting with one admittedly complex query per month and a small function.

This still took far longer than I expected to run, but then I realised there was more data.

Anyway, they work and they are generating results by month, so we will be able to draw nice time series charts, up to September 2011. Unfortunately, the ScraperWiki datastore is doing something quite weird – replacing float values with nulls or zeroes – and although I thought I might have fucked up type declarations, pragma tells me that the column types are what they ought to be. So I’ve got a query outstanding with the ScraperWiki folk.

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

SpaceX has now landed more boosters than most other rockets ever launch | Ars Technica
SpaceX has landed about 85 percent of the Falcon rockets it has launched. These days, more than 90 percent of all its missions launch on previously flown boosters. So rocket recycling is totally a thing.
Tesla profits drop 55% as Elon Musk dodges cheap car questions | Ars Technica
Musk continues to maintain that Tesla's future is in AI. It spent $1 billion on GPUs in Q1—almost as much as it spent on total R&D during the same time // that's funny as they have both training and inference asics
Man who bought fake Greek driving licence is sentenced | Bradford Telegraph and Argus
Saghawat Hussain paid £1,800 for a counterfeit full Greek driving licence, which he then sent to the Driver and Vehicle Licensing Agency (DVLA). He had hoped to dupe the Swansea-based agency into giving him a full UK licence so that he could drive illegally having not passed his test // what is it with these […]
Eric Eiswert: Ex-Pikesville athletic director framed principal with AI voice, police say - The Baltimore Banner
Baltimore County Police arrested Pikesville High School’s former athletic director Thursday morning and charged him with using artificial intelligence to impersonate Principal Eric Eiswert, leading the public to believe Eiswert made racist and antisemitic comments behind closed doors // a real one!
Royal Navy shoots down Houthi missile in first since Gulf War
The first time *ever* - the Iraqi Silkworm was a cruise missile (and was more than 30 years ago). anyway, give 'em a banana
Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference
in times of lower throughput needs, it is more energy-efficient to run TP8 than TP2 // because the GPUs can frequency-scale down. also worth remembering that the pre-fill is compute bound, the rest is memory bound
Inflation comes unstuck | UBS Global
Aside from rents and fictitious owners’ equivalent rents, there is deflation in almost every consumer price subcategory somewhere in the US. With collapsing clothing prices in Tampa or plunging communication prices in Chicago, it is difficult to argue that structural inflation stickiness exists when every key sector has deflation somewhere in the country. Food prices […]
Christian Heinzel on X: "God Rheinmetall is just so fucking cool, fully containerised 3D printer, metal 3D printer and CNC setup to produce custom replacement parts right behind the front, gone from a first display unit to production units being sent to U
God Rheinmetall is just so fucking cool, fully containerised 3D printer, metal 3D printer and CNC setup to produce custom replacement parts right behind the front, gone from a first display unit to production units being sent to Ukraine in less than a year
Saison startet im April: Das sind die fünf schönsten Biergärten Berlins
dammit why didn't I know about 5?
(1) Arin Dube on X: "Important new research showing how minimum wages were absorbed in past decade in US. It led to reallocation towards higher prod firms. Most of the cost increase was passed through to prices. Employment changes were modest." / Twitter
The reallocation results are quite important. They find MW winnows out lower prod employers but surviving higher prod employers actually see profits rise // Card-Krueger, the gift that keeps giving. A crucial way in which minimum wages ended up not costing jobs was that especially crappy firms shut down and their employees were rehired into […]