tevps.net - Blog - Automating a meme: Compound Movies

Automating a meme: Compound Movies

14 Feb 2011

Humour

Earlier tonight dylanbeattie started tweeting a few things with the #compoundmovies hashtag. Basic idea is that you take two films, where the first ends with the same word as the first word of the second and mash them together, and some of them are pretty funny.

Of course, this obviously lends itself to automation, and my suggestion of trawling IMDB got the response from him that they've already supplied the data for me, so no trawling needed. However, IMDB contains far too many movies, most of which have names I don't even know the meanings for, and generating the data for this from the IMDB data will take an insane amount of time. However, there's also Wikipedia, which despite some complaints I'd heard about it's API, appears to be pretty easy to use.

So, here we go, a #compoundMovies generator. Either run it as "compoundGenerator.py imdb" if you've got lots of time, or as "compoundGenerator.py wikipedia <some category name>" (without the "Category:" bit). I tried it out on "British_films" and got back a few interesting things:

Elizabeth: The Golden Age of Heroes
Every Home Should Have One of Our Dinosaurs Is Missing
Frankenstein and the Monster from Hell Drivers
From Time to Time Bandits
Harry Potter and the Chamber of Secrets & Lies
Live and Let Die Another Day
March or Die Screaming, Marianne
Rosencrantz & Guildenstern Are Dead Cool
Reluctant Bride and Prejudice

I then found "English-language_films" which gets us

10 Things I Hate About You Only Live Once
No Country for Old Men in Black
Nudist Colony of the Dead Ringers
Abbott and Costello in the Foreign Legion of the Dead
Jaws: The Revenge of the Nerds III: The Next Generation

(That gets 173k entries, so it's only a sampling).

Anyone else got other good categories? Note that this will also work on other things that aren't movies...

Previously: Repository crawlers for Mercurial (or why you need to learn about revsets) Next: Data visualisation: How weird is our jukebox?

Comments

With an account on the Fediverse or Mastodon, you can respond to this post. Since Mastodon is decentralized, you can use your existing account hosted by another Mastodon server or compatible platform if you don't have an account on this one. Known non-private replies are displayed below.