The web speaks

Mar 04, 2009 01:41

Here's the 'best' of the first half-dozen 'poems' from my new program: it babbles at random according to a Markov model using the most common bigrams from Google's whole-web corpus, constrained to a Shakespearean sonnet.

not safe for work )

Leave a comment

dariusk March 4 2009, 13:29:52 UTC
Oh, that is amazing. Is the source up anywhere yet? (I've been working on a different sort of sonnet-generator myself.)

Reply

dariusk March 4 2009, 13:31:32 UTC
Err, I just realized you're probably accessing that 6-DVD set of data, though?

Reply

darius March 4 2009, 18:41:48 UTC
Not directly, though I might go and buy it. The data's from someone at Google who wrote a great article about some of the cool things you can do with that dataset -- but the article hasn't been published yet so I don't know if I can say more or give out this condensation of the data.

Reply

darius March 4 2009, 18:36:49 UTC
Thanks! I guess I'll put the code up on github as it is now (an ugly mess). Different how?

Reply

dariusk March 4 2009, 18:39:50 UTC
I helped a poet I know write something that takes existing sonnets and does some simple regex to kind of rearrange them in interesting ways.

Reply

darius March 4 2009, 18:43:32 UTC
Ah, cool. If it weren't simple I wouldn't have the patience.

Reply

darius March 4 2009, 18:57:17 UTC

Leave a comment

Up