Automating a meme: Compound Movies

Feb 14, 2011 00:05

Earlier tonight
dylanbeattie started tweeting a few things with the #compoundmovies hashtag. Basic idea is that you take two films, where the first ends with the same word as the first word of the second and mash them together, and some of them are pretty funny.

Of course, this obviously lends itself to automation, and my suggestion of trawling IMDB got the response from him that they've already supplied the data for me, so no trawling needed. However, IMDB contains far too many movies, most of which have names I don't even know the meanings for, and generating the data for this from the IMDB data will take an insane amount of time. However, there's also Wikipedia, which despite some complaints I'd heard about it's API, appears to be pretty easy to use.

So, here we go, a #compoundMovies generator. Either run it as "compoundGenerator.py imdb" if you've got lots of time, or as "compoundGenerator.py wikipedia " (without the "Category:" bit). I tried it out on "British_films" and got back a few interesting things:
  • Elizabeth: The Golden Age of Heroes
  • Every Home Should Have One of Our Dinosaurs Is Missing
  • Frankenstein and the Monster from Hell Drivers
  • From Time to Time Bandits
  • Harry Potter and the Chamber of Secrets & Lies
  • Live and Let Die Another Day
  • March or Die Screaming, Marianne
  • Rosencrantz & Guildenstern Are Dead Cool
  • Reluctant Bride and Prejudice
I then found "English-language_films" which gets us
  • 10 Things I Hate About You Only Live Once
  • No Country for Old Men in Black
  • Nudist Colony of the Dead Ringers
  • Abbott and Costello in the Foreign Legion of the Dead
  • Jaws: The Revenge of the Nerds III: The Next Generation
(That gets 173k entries, so it's only a sampling).

Anyone else got other good categories? Note that this will also work on other things that aren't movies...
Previous post Next post
Up